arXiv: 1507.00317v3 [cs.SI] 5 Nov 2015 


Prom Competition to Complementarity: 
Comparative Influence Diffusion and Maximization* 

Wei Lu^ Wei Chen^ Laks V.S. Lakshmanan'l^ 


Wniversity of British Columbia 
Vancouver, B.C., Canada 
{welu, lakshQcs.ubc.ca 


l-Microsoft Research 
Beijing, China 
weicOmicrosoft.com 


November 6, 2015 


Abstract 

Influence maximization is a well-studied problem that asks for a small set of influential users from 
a social network, such that by targeting them as early adopters, the expected total adoption through 
influence cascades over the network is maximized. However, almost all prior work focuses on cascades of a 
single propagating entity or purely-competitive entities. In this work, we propose the Comparative Inde¬ 
pendent Cascade (Com-IC) model that covers the full spectrum of entity interactions from competition to 
complementarity. In Com-IC, users’ adoption decisions depend not only on edge-level information prop¬ 
agation, but also on a node-level automaton whose behavior is governed by a set of model parameters, 
enabling our model to capture not only competition, but also complementarity, to any possible degree. 
We study two natural optimization problems. Self Influence Maximization and Complementary Influence 
Maximization, in a novel setting with complementary entities. Both problems are NP-hard, and we de¬ 
vise efficient and effective approximation algorithms via non-trivial techniques based on reverse-reachable 
sets and a novel “sandwich approximation” strategy. The applicability of both techniques extends beyond 
our model and problems. Our experiments show that the proposed algorithms consistently outperform 
intuitive baselines on four real-world social networks, often by a significant margin. In addition, we learn 
model parameters from real user action logs. 


*An abridged version of this article is to appear in the Proceedings of VLDB Endowment (PVLDB), Volume 9, No. 2, and the 
42nd International Conference on Very Large Data Bases (VLDB 2016), New Delhi, India, September 5-9, 2016. The copyright 
of the PVLDB version is held by the VLDB Endowment. 
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1 Introduction 


Online social networks are ubiquitous and play an essential role in our daily life. Fueled by popular appli¬ 
cations such as viral marketing, there has been extensive research in influence and information propagation 
in social networks, from both theoretical and practical points of view. A key computational problem in this 
field is influence maximization, which asks to identify a small set of k influential users (also known as seeds) 
from a given social network, such that by targeting them as early adopters of a new technology, product, 
or opinion, the expected number of total adoptions triggered by social influence cascade (or, propagation) 
is maximized [8,15]. The dynamics of an influence cascade are typically governed by a stochastic dijfusion 
model, which specifies how adoptions propagate from one user to another in the network. 

Most existing work focuses on two types of diffusion models — single-entity models and pure-competition 
models. A single-entity model has only one propagating entity for social network users to adopt: the classic 
Independent Cascade (IC) and Linear Thresholds (LT) models [15] belong to this category. These models, 
however, ignore complex social interactions involving multiple propagating entities. Considerable work has 
been done to extend IC and LT models to study competitive influence maximization, but almost all models 
assume that the propagating entities are in pure competition and users adopt at most one of them [1,3,5- 
7,13,16,21]. 

In reality, the relationship between different propagating entities is certainly more general than pure 
competition. In fact, consumer theories in economics have two well-known notions: substitute goods and 
complementary goods [22]. Substitute goods are similar ones that compete, and can be purchased, one in 
place of the other, e.g., smartphones of various brands. Complementary goods are those that tend to be 
purchased together, e.g,. iPhone and its accessories, computer hardware and software, etc. There are also 
varying degrees of substitutability and complementarity: buying a product could lessen the probability of 
buying the other without necessarily eliminating it; similarly, buying a product could boost the probability 
of buying another to any degree. Pure competition only corresponds to the special case of perfect substitute 
goods. 

The limitation of pure-competition models can be exposed by the following example. Consider a viral 
marketing campaign featuring iPhone 6 and Apple Watch. It is vital to recognize the fact that Apple Watch 
generally needs an iPhone to be usable, and iPhone’s user experience can be greatly enhanced by a pairing 
Apple Watch (see, e.g., http://bit.ly/lGOqesc). Clearly none of the pure-competition models is suitable 
for this campaign because they do not even allow users to adopt both the phone and the watch! This 
motivates us to design a more powerful, expressive, yet reasonably tractable model that captures not only 
competition, but also complementarity, and to any possible degrees associated with these notions. 

To this end, we propose the Comparative Independent Cascade model, or Com-IC for short, which, unlike 
most existing diffusion models, consists of two critical components that work jointly to govern the dynamics 
of diffusions: edge-level information propagation and a Node-Level Automaton (NLA) that ultimately makes 
adoption decisions based on a set of model parameters, known as the Clobal Adoption Probabilities (GAPs). 
Of these, edge-level propagation is similar to the propagation captured by the classical IC and LT models, 
but only controls information awareness. The NLA is a novel feature and is unique to our proposal. Indeed, 
the term “comparative” comes from the fact that once a user is aware, via edge-level propagation, of multiple 
products, intuitively she makes a comparison between them by “running” her NLA. Notice that “comparative” 
subsumes “competitive” and “complementary” as special cases. In theory, the Com-IC model is able to 
accommodate any number of propagating entities (items) and cover the entire spectrum from competition 
to complementarity between pairs of items, reflected by the values of GAPs. 

In this work, as the first step toward comparative influence diffusion and viral marketing, we focus on 
the case of two items. At any time, w.r.t. any item A, a user in the network is in one of the following 
four states: A-idle, A-suspended, A-rejected, or A-adopted. The NLA sets out probabilistic transition rules 
between states, and different GAPs are applied based on a given user’s state w.r.t. the other item B and the 
relationship between A and B. Intuitively, competition (complementarity) is modeled as reduced probability 
(resp., increased probability) of adopting the second item after the first item is already adopted. After a user 
adopts an item, she propagates this information to her neighbors in the network, making them aware of the 
item. The neighbor may adopt the item with a certain probability, as governed by her NLA. 
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We then define two novel optimization problems for two complementary items A and B. Our first problem, 
Self Influence Maximization ('SelfInfMax^, asks for k seeds for A such that given a fixed set of ;B-seeds, the 
expected number of ^-adopted nodes is maximized. The second one, Complementary Influence Maximization 
( CompInfMax ), considers the flip side of SelfInfMax: given a fixed set of .A-seeds, find a set of k seeds 
for B such that the expected increase in ^-adopted nodes thanks to B is maximized. To the best of our 
knowledge, we are the first to systematically study influence maximization for complementary items. 

We show that both problems are NP-hard under Com-IC. Moreover, two important properties, submod¬ 
ularity and monotonicity (see §2), which would allow a greedy approximation algorithm frequently used for 
influence maximization, do not hold in unrestricted Com-IC model. Even when we restrict Com-IC to mutual 
complementarity, submodularity still does not hold in general. 

To circumvent the aforementioned difficulties, we first show that submodularity holds for a subset of 
the complementary parameter space. We then make a non-trivial extension to the Reverse-Reachable Set 
(RR-set) techniques [2,23,24], originally proposed for influence maximization with single-entity models, to 
obtain effective and efficient approximation solutions to both SelfInfMax and CompInfMax. Next, we 
propose a novel Sandwich Approximation (SA) strategy which, for a given non-submodular set function, 
provides an upper bound function and/or a lower bound function, and uses them to obtain data-dependent 
approximation solutions w.r.t. the original function. We further note that both techniques are applicable 
to a larger context beyond the model and problems studied in this paper: for RR-sets, we provide a new 
definition and general sufficient conditions not covered by [2,23,24] that apply to a large family of influence 
diffusion models, while SA applies to the maximization of any non-submodular functions that are upper- 
and/or lower-bounded by submodular functions. 

In experiments, we first learn GAPs from user action logs from two social networking sites - Flixster.com 
and Douban.com. We demonstrate that our approximation algorithms based on RR-sets and SA techniques 
consistently outperform several intuitive baselines, typically by a significant margin on real-world networks. 

To summarize, we make the following contributions: 

• We propose the Com-IC model to characterize influence diffusion dynamics of products with arbitrary 
degree of competition or complementarity, and identify a subset of the parameter space under which 
submodularity and monotonicity of influence spread hold, paving the way for approximation algorithms 
(§3 and §5). 

• We propose two novel problems - Self Influence Maximization and Complementary Influence Maxi¬ 
mization - for complementary products under the Com-IC model (§4). 

• We show that both problems are NP-hard, and devise efficient and effective approximation solutions by 
non-trivial extensions to RR-set techniques and by proposing Sandwich Approximation, both having 
applicability beyond this work (§6). 

• We conduct empirical evaluations on four real-world social networks and demonstrate the superiority 
of our algorithms over intuitive baselines, and also propose a methodology for learning global adoption 
probabilities for the Com-IC model from user action logs of social networking sites (§7). 

For better readability, most of the proofs, as well as some additional theoretical and experimental results 
are presented in the appendix. 


2 Background & Related Work 

Given a graph G = (V, E,p) where p : if —[0,1] specifies pairwise influence probabilities (or weights) between 
nodes, and k G Z_|_, the influence maximization problem asks to find a set S' C P of fc seeds, activating which 
leads to the maximum expected number of active nodes (denoted cr(S)) [15[. Under both IC and LT models, 
this problem is NP-hard; Chen et al. [9,10] showed computing cr(S) exactly for any S C U is ^/P-hard. 
Fortunately, cr(-) is a submodular and monotone function of S for both IC and LT, which allows a simple 
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Figure 1: Com-IC model: Node-level automaton for product/item A. 


greedy algorithm with an approximation factor of 1—1/e—e, for any e > 0 [15,20]. A set function / : 2^ ^ K>o 
is submodular if for any S CT CU and any x G U\T, f{SU{x}) — f{S) > f{TU{x}) — T{S), and monotone 
if f{S) < f{T) whenever S C T C U. Tang et al. [23,24] proposed new randomized approximation algorithms 
which are orders of magnitude faster than the original greedy algorithms in [15[. 

In competitive influence maximization [1,3,5-7,13,16,21] (also surveyed in [8]), a common thread is the 
focus on pure competition, which only allows users to adopt at most one product or opinion. Most works 
are from the follower’s perspective [1,5,6,13], i.e., given competitor’s seeds, how to maximize one’s own 
spread, or minimize the competitor’s spread. Lu et al. [16] aims to maximize the total influence spread of all 
competitors while ensuring fair allocation. 

For viral marketing with non-competing items, Datta et al. [11] studied influence maximization with items 
whose propagations are independent. Narayanam et al. [19] studied a setting with two sets of products, where 
a product can be adopted by a node only when it has already adopted a corresponding product in the other 
set. Their model extends LT. We depart by defining a significantly more powerful and expressive model in 
Com-IC, compared to theirs which only covers the special case of perfect complementarity. 

Myers and Leskovec analyzed Twitter data to study the effects of different cascades on users and predicted 
the likelihood of a user adopting a piece of information given the cascades to which the user was previously 
exposed [18[. McAuley et al. used logistic regression to learn substitute/complementary relationships be¬ 
tween products from user reviews [17] . Both studies focus on data analysis and behavior prediction and do 
not provide diffusion modeling for competing and complementary items, nor do they study the influence 
maximization problem in this context. 


3 Comparative Independent Cascade Model 

Review of Classical IC Model 

In the IC model [15], there is just one entity (e.g., idea or product) being propagated through the network. 
An instance of the model has a directed graph G = {V,E,p) where p : if —)■ [0,1], and a seed set S C V. 
For convenience, we use for p{u, v). At time step 0, the seeds are active and all other nodes are inactive. 
Propagation proceeds in discrete time steps. At time t, every node u that became active at f — 1 makes one 
attempt to activate each of its inactive out-neighbors v. This can be seen as node u “testing” if the edge 
{u, v) is “live” or “blocked”. The out-neighbor v becomes active at t iff the edge is live. The propagation ends 
when no new nodes become active. 

Key differences from IC model 

In the Comparative IC model (Com-IC), there are at least two products. For ease of exposition, we focus 
on just two products A and B below. Each node can be in any of the states {idle, suspended, adopted, 
rejected} w.r.t. each of the products. All nodes are initially in the joint state of (A-idle, ;B-idle). One of the 
biggest differences between Com-IC and IC is the separation of information diffusion (edge-level) and the 
actual adoption decisions (node-level). Edges only control the information that flows to a node: e.g., when u 
adopts a product, its out-neighbor v may be informed of this fact. Once that happens, v uses its own node 
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level automaton (NLA) to decide which state to transit to. This depends on u’s current state w.r.t. the two 
products as well as parameters corresponding to the state transition probabilities of the NLA, namely the 
Global Adoption Probabilities, defined below. 

A concise representation of the NLA is in Figure 1. Each state is indicated by the label. The state diagram 
is self-explanatory. E.g., with probability g^| 0 , a node transits from a state where it’s A-idle to A-adopted, 
regardless of whether it was S-idle or S-suspended. 

From the A-suspended state, it transits to A-adopted w.p. pj^ and to A-rejected w.p. 1 — pj\. The 
probability called reconsideration probability, as well as the reconsideration process will be explained 
below. Note that in a Com-IC diffusion process defined below, not all joint state is reachable from the initial 
(A-idle, ,B-idle) state, e.g., (A-idle, ,B-rejected). Since all unreachable states are irrelevant to adoptions, they 
are negligible (details in Appendix). 

Global Adoption Probability (GAP) 

The Global Adoption Probabilities (GAPs), consisting of four parameters Q = ( 9 . 410 , 9 ^|b, 9 b| 0 , 9 b|. 4 ) G [0,1]^ 
are important parameters of the NLA which decide the likelihood of adoptions after a user is informed of an 
item. 9 _ 4|0 is the probability that a user adopts A given that she is informed of A but not ;B-adopted, and 
9_4|B is the probability that a user adopts A given that she is already yB-adopted. A similar interpretation 
applies to 9b 1 0 and 931.4. 

Intuitively, GAPs reflect the overall popularity of products and how they are perceived by the entire 
market. They are considered aggregate estimates and hence are not user specific in our model. We provide 
further justifications at the end of this section and describe a way to learn GAPs from user action log data 
in §7. 

GAPs enable Gom-IG to model competition and complementarity, to arbitrary degrees. We say that A 
competes with B iff 9b|.4 < 9b|0- Similarly, A complements B iff 931.4 > 9b|0- We include the special case of 
9 b|^ = 93|0 in both cases above for convenience of stating our technical results, and it actually means that 
the propagation of B is completely independent of A (c/. Lemma 3). Gompetition and complementarity in the 
other direction are similar. The degree of competition and complementarity is determined by the difference 
between the two relevant GAPs, i.e., | 93 |^ — 9310 ] and I 9.413 — 9 ^| 0 |. For convenience, we use Q+ to refer to 
any set of GAPs representing mutual complementarity: ( 9.410 < 97113 ) A (9310 < 9 b| 7 i), and similarly, for 
GAPs representing mutual competition: (97110 > 971|b) A (9310 > 9B|7t)- 

Diffusion Dynamics 

Let G = (V,E,p) be a directed social graph with pairwise influence probabilities. Let Sj\^,Sb C P be the 
seed sets for A and B. Influence diffusion under Gom-IG proceeds in discrete time steps. Initially, every 
node is A-idle and B-idle. At time step 0, every u £ becomes A-adopted and every u £ Sb becomes 
B-adopted^. If u G H Sb, we randomly decide the order of u adopting A and B with a fair coin. For ease 
of understanding, we describe the rest of the diffusion process in a modular way in Figure 2. We use N'^{v) 
and N~{v) to denote the set of out-neighbors and in-neighbors of v, respectively. 

We draw special attention to tie-breaking and reconsideration. Tie-breaking is used when a node’s in¬ 
neighbors adopt different products and try to inform the node at the same step. Node reconsideration concerns 
the situation that a node v did not adopt A initially but later after adopting B it may reconsider adopting 
A: when B competes with A (97110 > 971|b)) will not reconsider adopting A, but when B complements 
A (specifically, 97110 < 97113), v will reconsider adopting A. In the latter case, the probability of adopting 
A, PA, is defined in such a way that the overall probability of adopting A is equal to 97113 (since 97113 = 
9.4|0 + (1 - < iAI @) PA )- 

^ No generality is lost in assuming seeds adopt an item without testing the NLA: for every v G V, we can create two dummy 
nodes and edges and (v^^v) with Pvjs^,v = Pvq,v = 1- Requiring seeds to go through NLA is equivalent to 

constraining that A^-seeds {B-seeds) be selected from all (resp. u^’s). 
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Global Iteration. At every time step t > 1, for all nodes that became A- or S-adopted at t — 1, their 
outgoing edges are tested for transition (1 below). After that, for each node v that has at least one 
in-neighbor (with a live edge) becoming A- and/or ;B-adopted at t — 1 , u is tested for possible state 
transition (2-4 below). 

1. Edge transition. For an untested edge (u,v), flip a biased coin independently: (u,v) is live w.p. 
Pu,v and blocked w.p. 1 — Pu,v Each edge is tested at most once in the entire diffusion process. 

2. Node tie-breaking. Consider a node v to be tested at time t. Generate a random permutation tt 
of v’s in-neighbors (with live edges) that adopted at least one product at t—1. Then, test v with each 
such in-neighbor u and u’s adopted item (A and/or B) following tt. If there is a w G N~{v) adopting 
both A and B, then test both products, following their order of adoption by w. 

3. Node adoption. Consider the case of testing an A-idle node v for adopting A (Figure 1). If u is not 
;B-adopted, then w.p. g_ 4 | 0 , it becomes A-adopted and w.p. 1 — g_ 4|0 it becomes A-suspended. If v is 
;B-adopted, then w.p. g_ 4 |B, it becomes A-adopted and w.p. 1 — it becomes A-rejected. The case 
of adopting B is symmetric. 

4. Node reconsideration. Consider an A-suspended node v that just adopts B at time t. Define 
PA = niax{( 7 _ 4 |B — ( 7 _ 4 | 0 ,0}/(l — 9 ^ 10 ). Then, v reconsiders to become A-adopted w.p. pA-, or A-rejected 
w.p. 1 — PA- The case of reconsidering B is symmetric. 


Figure 2: Com-IC model: diflfusion dynamics 


Design Considerations 

The design of Com-IC not only draws on the essential elements from a classical diffusion model (IC) stem¬ 
ming from mathematical sociology, but also closes a gap between theory and practice, in which diffusions 
typically do not occur just for one product or with just one mode of pure competition. With GAPs in the 
NLA, the model can characterize any possible relationship between two propagating entities: competition, 
complementarity, and any degree associated with them. GAPs are fully capable of handling asymmetric 
relationship between products. E.g., an Apple Watch (A) is complemented more by an iPhone {B) than the 
other way round: many functionalities of the watch are not usable without a pairing iPhone, but an iPhone 
is totally functional without a watch. This asymmetric complementarity can be expressed by any GAPs 
satisfying (cia\b ~ 9 . 4 | 0 ) > ~ 9 b| 0 ) ^ 0- Furthermore, introducing NLA with GAPs and separating the 

propagation of product information from actual adoptions reflects Kalish’s famous characterization of new 
product adoption [14]: customers go through two stages - awareness followed by actual adoption. In Kalish’s 
theory, product awareness is propagated through word-of-mouth effects; after an individual becomes aware, 
she would decide whether to adopt the item based on other considerations. Edges in the network can be 
seen as information channels from one user to another. Once the channel is open (live), it remains so. This 
modeling choice is reasonable as competitive goods are typically of the same kind and complementary goods 
tend to be adopted together. 

We remark that Com-IC encompasses previously-studied single-entity and pure-competition models as 
special cases. When qA \^ = 9 b |0 = 1 and qA\B = <1 B\A = 0> Com-IC reduces to the (purely) Competitive 
Independent Cascade model [8]. If, in addition, gB|0 is 0, the model further reduces to the classic IC model. 

4 Formal Problem Statements 

Many interesting optimization problems can be formulated under the expressive Com-IC model. In this 
work, we focus on influence maximization with complementary propagating entities, since competitive viral 
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marketing has been studied extensively (see §2). In what follows, we propose two problems. The first one, 
Self Influence Maximization (SelfInfMax), is a natural extension to the classical influence maximization 
problem [15]. The second one is the novel Complementary Influence Maximization (CompInfMax), where 
the objective is to maximize complementary effects (or “boost” the expected number of adoptions) by selecting 
the best seeds of a complementing good. 

Given the seed sets S_a,Sis, we first define crj^{SA, Sb) and crsiSj^, Sb) to be the expected number of 
^-adopted and S-adopted nodes, respectively under the Com-IC model. Clearly, both cryi and (Tg are real¬ 
valued bi-set functions mapping 2^ x 2^ to [0, |y|], for any fixed Q. Unless otherwise specified, GAPs are not 
considered as arguments to a a and erg as Q is constant in a given instance of Com-IC. Also, for simplicity, 
we may refer to cr_ 4 (-,-) as A-spread and crg(-,-) as ,B-spread. The following two problems are defined in 
terms of A-spread, without loss of generality. 

Problem 1 (SelfInfMax). Given a directed graph G = {V^E,p) with pairwise influence probabil¬ 
ities, B-seed set Sb C V, a cardinality constraint k, and a set of GAPs Q^, find an A-seed set 
G V of size k, such that the expected number of A-adopted nodes is maximized under Com-IC: 

e argmaxj.gv,|T|=fcO'A('r, S'g). 

SelfInfMax is obviously NP-hard, as it subsumes InfMax under the classic IC model when S'g = 0 
and ( 7 _ 4|0 = g^|g = 1. By a similar argument, it is ^(tP-hard to compute the exact value of cr_ 4 (S'_/i, S'g) and 
<^b{Sa,Sb) for any given Sa and Sb- 

Problem 2 (CompInfMax). Given a directed graph G = (V,E,p) with pairwise influence probabilities, 
A-seed set Sa G V, a cardinality constraint k, and a set of GAPs Q^, find a B-seed set G V of 
size k such that the expected increase (boost) in A-adopted nodes is maximized under Com-IC: S^ € 
argmaxrcu,|r|=fe[f^A(-S'A,ir) - cr_4(5'^, 0)]. 

Theorem 1. CompInfMax is NP-hard. 

Proof. Let I be an instance of InfMax with the IC model, defined by a directed graph G = {V, E) and 
budget b < |U|. We define an instance J of CompInfMax as follows. For each u G U, create a dummy copy 
v' and a directed edge {v' ,v) with influence probability Pv'^v = 1. Let V be the set of all dummy nodes. Set 
Sa = V, ( 7 _ 4|0 = 0, qA\B = 9b|0 = dB\A = 1- The budget b of selecting B seeds in instance J is the same b as 
in instance I. 

Claim 1. Consider the CompInfMax instance J. For all B-seed sets Sb, there exists a set Tg C V' such 
that <ja{Sa,Tb) > (Ja{.Sa,Sb). 

Proof of Claim 1. Let u G Sg n U be any regular node. Assume for now that v' ^ Sb. Let S'g denote the 
yB-seed set obtained by replacing all such v by its corresponding dummy copy v'. Since Sa = V■, all dummy 
nodes are A-adopted and all regular nodes are A-informed. Choosing v' to be a B-seed makes v' and v both 
B-adopted, so by reconsideration v becomes A-adopted as well. Clearly, the sets of regular nodes that are 
B-adopted under the configurations {Sa,Sb) and (B^jBg) are the same. Thus, sets of regular nodes that 
are A-adopted under these configurations are also the same, which verifies the claim. 

Now suppose V, v' G Sb- We claim that there exists a dummy node outside 'S'b, which follows from the 
fact that the budget b < |U| = |U'|. Now, replace v with any dummy node u' not in Bg and call the 
resulting B-seed set Bg. By an argument similar to the above case, we can see that every regular node that 
is A-adopted under configuration (B^,Bg) is also A-adopted under configuration (B^,Bg). There may be 
additional A-adopted regular nodes thanks to the seed u' G S'^ \ Sb. Again, the claim follows. Finally, if 
Bg n U = 0 (no regular node in Bg), then the claim trivially holds as we can simply let Tg = Bg. □ 

Now consider any T G V and let cric{T) denote the expected spread of T under the IC model in G. Let 
T' = {u' G UA u G T} be the set of corresponding dummy copies. Since Sa = V■, all regular nodes (in 
V) would become A-informed when Bg = 0. Now set Bg = T', which would make all nodes in T (regular) 


7 


become yB-adopted, and then ^-adopted (by reconsideration), and further propagate both A and B over the 
network. Hence, 


criciT) = aA{SA,T') - n, 


( 1 ) 


where n =def \V\ = \V'\. 

Next, to prove the theorem, it suffices to show: 

Claim 2. The subset G V is an optimal solution to the InfMax (with IC model) instance I if and 
only if T) C V’ is an optimal solution to the CompInfMax instance J under the Com-IC model, where 
Ti = {v' GV' :vG T*}. 

Proof of Claim 2. (=>): Suppose for a contradiction that T) is suboptimal to J. Let X' be an optimal 
solution to J instead, which implies that cr_ 4 (S'_ 4 ,X') > (Ja{SaiTI). 

If X' contains only dummy copies, let X be the corresponding set of regular nodes. Then, by Eq. (1), 

(^a{Sa, X') = (Tic{X) + n > cr^(S_ 4 , T() = aiciT^) + n. 

Hence aic{X) > (Jic{TA), contradicting to the fact that T* is optimal to I. 

If X' contains at least one regular node, then by repeated application of Claim 1, we can obtain a B-seed 
set with no regular nodes and furthermore dominates X' w.r.t. ^-spread. Thus, w.l.o.g. we can ignore B-seed 
sets containing regular nodes. 

(<^^): Let T) be an optimal solution to J. By the same domination argument, we can assume w.l.o.g. 
that T) consists of only dummy nodes. Then, the optimality of T* for instance I can be argued using Eq. (1) 
in an identical manner to the above. □ 

This completes the proof that CompInfMax is NP-hard. □ 

From the formulation of CompInfMax, we can intuitively see that the placement of B-seeds will be heav¬ 
ily dependent on the existing ^-seeds. For example, if Sa and Bg are in two different connected components 
of the graph, then evidently the boost is zero. In contrast, if they are rather close and can influence roughly 
the same region in the graph, the boost is likely to be high. Indeed, for the special case where gg |0 = 1 
and k > |> 5 '_ 4 |, directly “copying” ^-seeds to be B-seeds will give the optimal boost. However, this in no way 
diminishes the value of this problem, as Theorem 1 shows the NP-hardness in the general case. 

Theorem 2. For CompInfMax, when gg |0 = I and k > |B^|, we can solve the problem optimally by 
setting Bg to be Sa U X, where X is an arbitrary set in V\ Sa with size k — |B^|, that is (TAiSA, Sa U X) = 
max7’cy,|T|=fc f^A{SA, T). 

The proof of this theorem relies on the possible world model, which is defined in the next section. Hence, 
we defer to the proof to the appendix. 

5 Properties of Com-IC 

Since neither of SelfInfMax and CompInfMax can be solved in PTIME unless P = NP, we explore 
approximation algorithms by studying submodularity and monotonity for Com-IC, which may pave the way 
for designing approximation algorithms. Note that a a is a bi-set function taking arguments Sa and Bg, so we 
analyze the properties w.r.t. each of the two arguments. As appropriate, we refer to the properties of a a w.r.t. 
Sa (Bg) as self-monotonicity (resp., cross-monotonicity) and self-submodularity (resp., cross-submodularity). 


5.1 An Equivalent Possible World Model 

To facilitate a better understanding of Com-IC and our analysis on submodularity, we define a Possible 
World (PW) model that provides an equivalent view of the Com-IC model. Given a graph G = {V, E,p) and 
a diffusion model, a possible world consists of a deterministie graph sampled from a probability distribution 
over all subgraphs of G. For Com-IC, we also need some variables for each node to fix the outcomes of random 
events in relation to the NLA (adoption, tie-breaking, and reconsideration), so that influence cascade is fully 
deterministic in a single possible world. 

Generative Rules. Retain each edge (u,v) £ E w.p. pu,v {live edge) and drop it w.p. 1 — Pu,v {blocked 
edge). This generates a possible world W with Gw = (YtEw), Ew being the set of live edges. Next, for 
every node v G V, we 

1 . choose “thresholds” and independently and uniformly at random from [ 0 , 1 ], for comparison 
with GAPs in adoption decisions (when W is clear from context, we write and Og); 

2. generate a random permutation of all in-neighbors u £ N~{v) (for tie-breaking); 

3. sample a discrete value £ {A, B}, where each value has a probability of 0.5 (used for tie-breaking 
in case u is a seed of both A and B). 

Deterministic cascade in a PW. At time step 0, nodes in and Sb first become A-adopted and B- 
adopted, respectively (ties, if any, are broken based on r„). Then, iteratively for each step t > 1, a node v 
becomes “reachable” by A at time step t if t is the length of a shortest path from any seed u £ to v 

consisting entirely of live edges and A-adopted nodes. Node v then becomes A-adopted at step t if < x, 
where x = ( 7 _ 4|0 if v is not S-adopted, otherwise x = qy^\B- For re-consideration, suppose v just becomes B- 
adopted at step t, while being A-suspended (i.e., v was reachable by A before t steps but > qA\$)- Then, 

V adopts A if < qA\B- The reachability and reconsideration tests of B are symmetric. For tie-breaking, if 

V is reached by both A and B at t, the permutation 7 r„ is used to determine the order in which A and B are 
considered. In addition, if v is reached by A and B from the same in-neighbor, e.g., u, then the informing 
order follows the order in which u adopts A and B. 

The following lemma establishes the equivalence between this possible world model and Com-IC. This 
allows us to analyze monotonicity and submodularity using the PW model only. 

Lemma 1. For any fixed A-seed set Sa o,nd B-seed set Sb, the joint distributions of the sets of A-adopted 
nodes and B-adopted nodes obtained {i) by running a Com-IC diffusion from Sa o,nd Sb and {ii) by randomly 
sampling a possible world W and running a deterministic cascade from Sa and Sb in W, are the same. 

Equivalence Classes of Possible Worlds. Notice that in a possible world, and Og are both real 
values in the interval [0,1]. Thus in theory, the total number of possible worlds is infinite. In what follows, 
we establish a finite number of equivalence classes of possible worlds. This facilitates theoretical analysis of 
the model, e.g., the proof of Theorem 2 makes use of this property (see appendix). 

From the perspective of influence propagation dynamics under Com-IC, especially the final states of each 
node w.r.t. both products, the exact values of a\ and Og do not matter. Instead, we only need to know the 
ranges that a\ and Og fall in. For 0 ) 4 , the ranges are 

• [0,qA\$), [qA\$,qA\B), [^.AIbA] when qA\$ < qA\B, 

• [0,qA\B), [qA\B,qA\$), [ 9 ^ 10 , 1 ] when qA\B < 9^|0- 
Likewise for Og, they are: 

• [O,9b|0), [9b|0,9b|^), [9b|^,1] when qB\$ < qB\A, 

• [0,9b|^), [9b|.4,9b|0), [9b|0,1] when qB\A < 9b|0- 
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Given two possible worlds Wi and W 2 , we say that are equivalent iff each node v G V satisfies all of the 
following conditions: 

1 . and fall into the same range; 

2 . and also fall into the same range; 

3. (same order of in-neighbours in both worlds); 

4 ^Wl ^ _W2 

-t. ! y I y 

Clearly, for any two possible worlds Wi and W 2 in the same equivalence class, by model definition we 
have 

and = c7^^(Sa,Sb). 

for any ^-seed set Sa and S-seed set S'g. 

In view of the above, for any equivalence class W of possible worlds, by cr^{S a, Sb), without any 
ambiguity we mean {Sa, Sb), where W G W is any possible world in the equivalence class W. It is 
straightforward to verify that the total number of equivalence classes is finite, as opposed to the total 
number of possible worlds, which is uncountable. 

Let Pr[W] denote the total probability mass of all the possible worlds that belong to W. In principle, this 
probability can be computed using integration over a^^’s and ag’s, as there are still an uncountable number 
of possible worlds in any given equivalence class. An alternative method that does not involve integration is 
to look at the ranges in which a^^’s and ag’s fall. For example, Pr[g ^|0 < a)^ < Qaib] = Qa\B~Qa\$ (assuming 
complementarity). This is correct as the a-values are sampled uniformly at random from the interval [0,1]. 
We can then express the expected spread function using a linear combination of all equivalence classes: 

( 2 ) 

w 

where {Saj Sb) can be computed deterministically as we mentioned earlier in this subsection. 

5.2 Monotonicity 

It turns out that when A competes with B while B complements A, monotonicity does not hold in general 
(see Appendix for counter-examples). But note that these cases are rather unnatural. Hence, we focus on 
mutual competition (Q~) and mutural complementary cases (Q^), for which we can show self- and cross¬ 
monotonicity do hold. 

Theorem 3. For any fixed B-seed set Sb, ^^a^Sa, Sb) is monotonically increasing in Sa for any set of GAPs 
in Q’*' and Q~. Also, (Ja{Sa, Sb) is monotonically increasing in Sb for any GAPs in Q+, and monotonically 
decreasing in Sb for any Q~. 

5.3 Submodularity in Complementary Setting 

Next, we analyze self-submodularity and cross-submodularity for mutual complementary cases (Q^) that 
has direct impact on SelfInfMax and CompInfMax. The analysis for Q” is deferred to the appendix. 

For self-submodularity, we show that it is satisfied in the case of “one-way complementarity”, i.e., B 
complements A (g_ 4|0 < 9 a|b)) but A does not affect B ( 9 g |0 = qB\A)^ ''^ise versa (Theorem 4). We will 
also show the cr _4 is cross-submodular in S'g when qB\A — 1 (Theorem 5). However, both properties are not 
satisfied in general (see appendix for counter-examples). We give two useful lemmas first, and thanks to 
Lemma 2 below, we may assume w.l.o.g. that tie-breaking always favors A in complementary cases. 

Lemma 2. Gonsider any Gom-IG instance with Given fixed A- and B-seed sets, for all nodes v G V, 
all permutations of v’s in-neighbors are equivalent in determining if v becomes A-adopted and B-adopted, 
and thus the tie-breaking rule is not needed for mutual complementary case. 
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Lemma 3. In the Com-IC model, if B is indifferent to A (i.e., qB\A — then for any fixed B seed set 

Sb, the probability distribution over sets of B-adopted nodes is independent of A-seed set. Symmetrically, the 
probability distribution over sets of A-adopted nodes is also independent of B-seed set if A is indifferent to 

B. 

Theorem 4. For any instance of Com-IC model with < qA\B ^.nd qB\$ = qB\A; (*)■ '^A is self-submodular 
w.r.t. A seed set Sa, for any fixed B-seed set Sb- (**)• erg is self-submodular w.r.t. B seed set Sb and is 
independent of A-seed set Sa- 

Proof- First, {ii) holds trivially. By Lemma 3, A does not affect B’s diffusion in any sense. Thus, aB{SA, Sb) = 
(Tb{^, Sb)- It can be shown that the function (Tb( 0 , Sb) is both monotone and submodular w.r.t. Sb-, for any 
( 7 b| 0 , through a straightforward extension to the proof of Theorem 2.2 in Kempe et al. [15]. 

For (i), first we fix a possible world W and a K-seed set Sb- Let (Sa) be the set of ^-adopted nodes 
in possible world W with ^-seed set Sa {Sb omitted when it is clear from the context). Consider two sets 
S C T C V, some node u € V\T, and finally a node v € (TU {u}) \ (T). There must exist a live-edge 

path Pa from TU {n} consisting entirely of yl-adopted nodes. We denote by wq S TU {u} the origin of Pa- 
We first prove a key claim: Pa remains ^-adopted when Sa = { wq }. Consider any node Wi G Pa- In this 
possible world, if afff < < 7 ^ 10 , then regardless of the diffusion of B, Wi will adopt A as long as its predecessor 
Wi-i adopts A- If qA\$ < < qA\Bi then there must also be a live-edge path Pg from Sb to Wi that 

consists entirely of P-adopted nodes, and it boosts Wi to adopt A- Since qB\$ = qB\A^ has no effect on 
P-propagation (Lemma 3), and Pg always exists and all nodes on Pg would still be P-adopted through S'g 
(fixed) irrespective of ^-seeds. Thus, Pg always boosts Wi to adopt A as long as Wi-i is ^-adopted. Hence, 
the claim holds by a simple induction on Pa starting from wq- 

Then, it is easy to see wq = u- Suppose otherwise, then wq G T must be true. By the claim above and 
self-monotonicity of cr^ (Theorem 3), v G $)^({ri;o}) implies w G $)^(P), a contradiction. Therefore, we 
have V ^ {S) and v G (S'U {u}). This by definition implies (•)[ is submodular for any W and Sg, 

which is sufficient to show that (T^(S^, Sg) is submodular in Sa- D 

Theorem 5. In any instance of Com-IC with mutual complementarity Q^, (Ja is cross-submodular w-r.t. 
B-seed set Sb, for any fixed A-seed set, as long as qB\A = 1- 

Proof We first fix an „4-seed set S_ 4 . Consider any possible world W- Let 'f'5^(Sg) be the set of ^-adopted 
nodes in W with B seed-set Sg (and ^-seed set S^). Consider P-seed sets S C T C V and another P-seed 
n S C \ T. It suffices to show that for any v G 4')^ (T U {u}) \ 4')^(T), we have v G 4^)^(S U {«}) \ 4^)^(S). 

Let an A-path be a live-edge path from some ^-seed such that all nodes on the path adopt A, and P-path 
is defined symmetrically. If a node w has < 9 ^ 10 , we say that w is A-ready, meaning that w is ready for 
A and will adopt A if it is informed of A, regardless of its status on B- We say a path from S _4 is an A-ready 
path if all nodes on the path (except the starting ^-seed) are .A-ready. It is clear that all nodes on an .A-ready 
path would always adopt A regardless of P-seeds. We define ;B-ready nodes and paths symmetrically. We 
can show the following claim. 

Claim 3. On any A-path Pa, if some node w adopts B and all nodes before w on Pa are A-ready, then 
every node following w on Pa adopts both A and B, regardless of the actual B-seed set- 

Now consider the case of ^g = T U {n} first. Since v € 4')^(T U {u}), there must be an A-path Pa from 
some node Wq G Sa to v- If path P _4 is .A-ready, then regardless of B seeds, all nodes on P _4 would always be 
.A-adopted, but this contradicts the assumption that v ^ 4>)^(r). Therefore, there exists some node w that 
is not .A-ready, i.e., < 7,410 < aff < < 7 , 4 |g. Let w be the first non-.A-ready node on path P, 4 . Then w must have 
adopted B to help it adopt A, and ofg < qB\$- We can show the following key claim. 

Claim 4. There is a B-path Pb from some B-seed xq G TuIm} to w, such that even if xq is the only B-seed, 
w still adopts B- 
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With the key Claim 4, the rest of the proof follows the standard argument as in the other proofs. In 
particular, since even when xq is the only ;B-seed, w can still be ,B-adopted, then by Claim 3, v would be 
y4-adopted in this case. Thus we know that xq must be u, because otherwise it contradicts our assumption 
that V ^ 4'^(T) (also relying on the cross-monotonicity proof made for Theorem 3). Then again by the 
cross-monotonicity, we know that v G 4^^(S U {u}), but v ^ (S). This completes our proof. □ 


6 Approximation Algorithms 


We first review the state-of-the-art in influence maximization and then derive a general framework (§6.1) to 
obtain approximation algorithms for SelfInfMax (§6.2) and CompInfMax (§6.3). 

TIM algorithm. For influence maximization, Tang et al. [24] proposed the Two-phase Influence Maximiza¬ 
tion (TIM) algorithm that produces a (1 — 1/e — e)-approximation with at least 1 — probability in 

0{{k -\- I){\E\ -I- |C|) log |y|/e^) expected running time. It is based on the concept of Reverse-Reachable sets 
(RR-sets) [2], and applies to the Triggering model [15] that generalizes both IC and LT. TIM is orders of 
magnitude faster than greedy algorithm with Monte Carlo simulations [15], while still giving approximation 
solutions with high probability. Recently they propose a new improvement [23], which significantly reduces 
the number of RR-sets generated using martingale analysis. To tackle SelfInfMax and CompInfMax, 
we primarily focus on the challenging task of correctly generating RR-sets in Com-IC and other more gen¬ 
eral models, which is orthogonal to the contributions of [23] . Hereafter we focus on the framework of [24] . 
Due to the much more complex dynamics involved in Com-IC, adapting TIM to solve SelfInfMax and 
CompInfMax is far from trivial, as we shall show. 

Reverse-Reachable Sets. In a deterministic (directed) graph G' = {V,E'), for a fixed v G V, all nodes 
that can reach v form an RR-set rooted at v [2], denoted R{v). A random RR set encapsulates two levels 
of randomness: (i) a “root” node v is randomly chosen from the graph, and (ii) a deterministic graph is 
sampled according to a certain probabilistic rule that retains a subset of edges from the graph. E.g., for the 
IC model, each edge {u,v) G E is removed w.p. (1 —pu,v), independently. TIM first computes a lower bound 
on the optimal solution value and uses this bound to derive the number of random RR-sets to be sampled, 
denoted 9. To guarantee approximation solutions, 9 must satisfy: 


9 > e-^{8 + 2e)\V\ ■ 


Hog\V\ -\- log -f log 2 


(3) 


where OPTk is the optimal influence spread achievable amongst all size-fc sets, and e represents the trade-off 
between efficiency and quality: a smaller e implies more RR-sets (longer running time), but gives a better 
approximation factor. The approximation guarantee of TIM relies on a key result from [2], re-stated here: 

Proposition 1 (Lemma 9 in [24]). Fix a set S CV and a node v G V. Under the Triggering model, let pi 
be the probability that S activates v in a cascade, and p 2 be the probability that S overlaps with a random 
RR-set R{v) rooted at v. Then, pi = P 2 - 


6.1 A General Solution Framework 

We use Possible World (PW) models to generalize the theory in [2,24]. For a generic stochastic diffusion 
model M, an equivalent PW model M' is a model that specifies a distribution over W, the set of all possible 
worlds, where influence diffusion in each possible world in W is deterministic. Further, given a seed set (or 
two seed sets Sa and as in Com-IC), the distribution of the sets of active nodes (or A- and ;B-adopted 
nodes in Com-IC) in M is the same as the corresponding distribution in M'. Then, we define a generalized 
concept of RR-set through the PW model: 

Definition 1 (General RR-Set). For each possible world W G W and a given node v (a.k.a. root), the 
reverse reachable set (RR-set) of v in W, denoted by R\^(y), consists of all nodes u such that the singleton 
set {u} would activate v in W. A random RR-set of v is a set R\y(v) where W is randomly sampled from 
W using the probability distribution given in M'. 
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It is easy to see that Definition 1 encompasses the RR-set definition in [2,24] for IC, LT, and Triggering 
models as special cases. For the entire solution framework to work, the key property that RR-sets need to 
satisfy is the following: 

Definition 2 (Activation Equivalence Property). Let M be a stoehastie diffusion model and M' be its 
equivalent possible world model. Let G = (V, E,p) be a graph. Then, RR-sets have the Activation Equivalence 
Property if for any fixed S CV and any fixed v € V, the probability that S activates v according to M is the 
same as the probability that S overlaps with a random RR-set generated from v in a possible world in M'. 

As shown in [24], the entire correctness and complexity analysis is based on the above property, and in 
fact in their latest improvement ]23], they directly use this property as the definiton of general RR-sets. 
Proposition 1 shows that the activation equivalence property holds for the triggering model. We now provide 
a more general sufficient condition for the activation equivalence property to hold (Lemma 5), which gives 
concrete conditions on when the RR-set based framework would work. More specifically, we show that for any 
diffusion model M, if there is an equivalent PW model M' of which all possible worlds satisfy the following 
two properties, then RR-sets have the activation equivalence property. 

Possible World Properties 

• (PI): Given two seed sets S' C T, if a node v can be activated by S in a possible world W, then v shall 
also be activated by T in W. 

• (P2): If a node v can be activated by S in a possible world W, then there exists u G S such that the 
singleton seed set {u} can also activate v in W. In fact, (PI) and (P2) are equivalent to monotonicity 
and submodularity, as we formally state below. 

Lemma 4. Let W be a fixed possible world. Let fv,w{S) be an indicator function that takes on 1 if S can 
activate v in W, and 0 otherwise. Then, fv,w{') monotone and submodular for all v G V if and only if 
both (PI) and (P2) are satisfied in W. 

Lemma 5. Let M be a stochastic diffusion model and M' be its equivalent possible world model. Lf M' satisfies 
Properties (PI) and (P2), then the RR-sets as defined in Definition 1 have the activation equivalence property 
as in Definition 2. 

Comparing with directly using the activation equivalence property as the RR-set definition in [23], our 
RR-set definition provides a more concrete way of constructing RR-sets, and our Lemmas 4 and 5 provide gen¬ 
eral conditions under which such constructions can ensure algorithm correctness. Algorithm 1, GeneraITIM, 
outlines a general solution framework based on RR-sets and TIM. It provides a probabilistic approximation 
guarantee for any diffusion models that satisfy (PI) and (P2). Note that the estimation of a lower bound 
LB of OPTk (line 1) is orthogonal to our contributions and we refer the reader to [24] for details. Finally, 
we have: 

Theorem 6. Suppose for a stochastic diffusion model M with an equivalent PW model M', that for every 
possible world W and every v G V, the indicator function fv,w is monotone & submodular. Then for influence 
maximization under M with graph G = (V, E,p) and seed set size k, GeneralTIM (Algorithm 1 ) applied on the 
general RR-sets (Definition 1 ) returns a (1 — 1/e —e)-approximate solution with at least 1 — probability. 

Theorem 6 follows from Lemmas 4 and 5, and the fact that all theoretical analysis of TIM relies only on 
the Chernoff bound and the activation equivalence property, “without relying on any other results specific to 
the IC model” [24]. Next, we describe how to generate RR-sets correctly for SelfInfMax and CompInfMax 
under Com-IC (line 3 of Algorithm 1), which is much more complicated than IC/LT models ]24]. We will 
first focus on submodular settings for SelfInfMax (Theorem 4) and CompInfMax (Theorem 5). In §6.4, 
we propose Sandwich Approximation to handle general where submodularity does not hold. 
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Algorithm 1: GeneralTIM (G = {V,E,p), k, e, i) 

1 LB ^ lower bound of OPT^ estimated by method in [24] 

2 compute 0 using Eq. (3) with LB replacing OPT^ 

3 TZ <— generate 9 random RR-sets according to Definition 1 //for SelfInfMax, use rr-Sim or rr-Sim+; for 

CompInfMax, use RR-CIM 

4 for i = 1 to k do 

5 Vi ^ the node appearing in the most RR-sets in TZ 

6 S i — S U // S was initialized as 0 

7 remove all RR-sets in which Vi appears 

8 return S as the seed set 


6.2 Generating RR-Sets for SelfInfMax 

We present two algorithms, RR-SIM and RR-SIM+, for generating random RR-sets per Definition 1. The 
overall algorithm for SelfInfMax can be obtained by plugging RR-SIM or RR-SIM+ into GeneralTIM (Al¬ 
gorithm 1). 

According to Definition 1, for SelfInfMax, the RR-set of a root u in a possible world W, Rw{v), is 
the set of nodes u such that if u is the only A-seed, v would be A-adopted in W, given any fixed S-seed set 
S'g. By Theorems 3 and 4 (whose proofs indeed show that the indicator function fv,w{S) is monotone and 
submodular), along with Lemmas 4 and 5, we know that RR-sets following Definition 1 have the activation 
equivalence property. We now focus on how to construct RR-sets following Definition 1. Recall that in Com- 
IC, adoption decisions for A are based on a number of factors such as whether v is reachable via a live-edge 
path from S'_4 and its state w.r.t. B when reached by A. Note that 5g|0 = implies that ;B-difFusion is 
independent of A (Lemma 3). Our algorithms take advantage of this fact, by first revealing node states w.r.t. 
B, which gives a sound basis for generating RR-sets for A. 

6.2.1 The RR-SIM Algorithm 

Conceptually, RR-SIM (Algorithm 2) proceeds in three phases. Phase I samples a possible world according 
to §5.1 (omitted from the pseudo-code). Phase II is a forward labeling process from the input .B-seed set Sb 
(lines 2 to 7): a node v becomes B-adopted if < gg|0 and v is reachable from Sb via a path consisting 
entirely of live edges and B-adopted nodes. In Phase III (lines 8 to 15), we randomly select a node v and 
generate RR-set Rw{v) by running a Breadth-First Search (BPS) backwards (following incoming edges). 
Note that the RR-set generation for IC and LT models [24] is essentially a simpler version of Phase III. 
Backward BFS. Given W, an RR-set Rw{v) includes all nodes explored in the following backward BPS 
procedure. Initially, we enqueue v into a FIFO queue Q. We repeatedly dequeue a node u from Q for 
processing until the queue is empty. 

• Case 1: u is B-adopted. There are two sub-cases: (f). If a'^ < q_A\B, then u is able to transit from A- 

informed to A-adopted. Thus, we continue to examine u’s in-neighbors. For all unexplored w S N~(u), 
if edge {w,u) is live, then enqueue w; (ii). If a'^ > then u cannot transit from A-informed to 

A-adopted, and thus u has to be an A seed to become A-adopted. In this case, u’s in-neighbors will 
not be examined. 

• Case 2: u is not B-adopted. Similarly, if a\ < perform actions as in 1(f); otherwise perform 

actions as in l{ii). 

Theorem 7. Under one-way complementarity (q'_ 4|0 < qA\B 9b|0 = 9b\a)! RR-sets generated by 
the RR-SIM algorithm satisfy Definition 1 for the SelfInfMax problem. As a result, Theorem 6 applies to 
GeneralTIM with RR-SIM in this case. 
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Algorithm 2: RR-SIM {G = {V,E), v, Sb) 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


create an empty FIFO queue Q and empty set R 

enqueue all nodes in Sb into Q // start forward labeling 

while Q is not empty do 

u ^ Q.dequeueQ and mark u as B-adopted 
foreach v € N^{u) such that (u,u) is live do 
if A u is not visited then 

I (5.enqueue( // also mark v as visited 
clear Q, and then enqueue v // start backward bfs 
while Q is not empty do 
u -ir- Q-dequeueO 
R ^ RU {u} 

if {u is S-adopted A < Qaib) V {u is not ;B-adopted A < <7^10) then 

foreach w G N~{u) such that {w,u) is live do 
if w is not visited then 

(5.enqueue( rc) // also mark w visited 

return R as the RR-set 


Lazy sampling. For RR-SIM to work, it is not necessary to sample all edge- and node-level variables (i.e., 
the entire possible world) up front, as the forward labeling and backward BFS are unlikely to reach the 
whole graph. Hence, we can simply reveal edge and node states on demand (“lazy sampling”), based on the 
principle of deferred decisions. In light of this observation, the following improvements are made to RR-SIM. 
First, the first phase is simply skipped. Second, in Phase II, edge states and a-values are sampled as the 
forward labeling from Ss goes on. We record the outcomes, as it is possible to encounter certain edges and 
nodes again in phase (III). Next, for Phase III, consider any node u dequeued from Q. We need to perform 
an additional check on every incoming edge {w, u). If (w, u) has already been tested live in Phase II, then we 
just enqueue w. Otherwise, we first sample its live/blocked status, and enqueue w if it is live. Algorithm 2 
provides the pseudo-code for RR-SIM, where sampling is assumed to be done whenever we need to check the 
status of an edge or the a-values of a node. 

Expected time complexity. For the entire seed selection (Algorithm 1 with RR-SIM) to guarantee approx¬ 
imate solutions, we must estimate a lower bound LB of OPT^ and use it to derive the minimum number 
of RR-sets required, defined as 6 in Eq. (3). In expectation, the algorithm runs in 0{9 ■ EPT) time, where 
EPT is the expected number of edges explored in generating one RR-set. Clearly, EPT — EPTp + EPTb, 
where EPTp {EPTp) is the expected number of edges examined in forward labeling (resp., backward BFS). 
Thus, we have the following result. 

Lemma 6. In expectation, GeneralTIM with RR-SIM runs in O {{k + f')(|P| -|- \E\) log \V\ (1 -I- EPTp/EPTp)) 
time. 

EPTp increases when the input S-seed set grows. Intuitively, it is reasonable that a larger B-seed set 
may have more complementary effect and thus it may take longer time to find the best A-seed set. However, 
it is possible to reduce EPTp as described below. 

6.2.2 The RR-SIM-R Algorithm 

The RR-SIM algorithm may incur efficiency loss because some of the work done in forward labeling (Phase H) 
may not be used in backward BFS (Phase HI). E.g., consider an extreme situation where all nodes explored 
in forward labeling are in a different connected component of the graph than the root v of the RR-set. In this 
case, forward labeling can be skipped safely and entirely! To leverage this, we propose RR-SIM+ (pseudo-code 
presented as Algorithm 3), of which the key idea is to run two rounds of backward BFS from the random 
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Algorithm 3: RR-SIM+ (G = {V,E), v, S'g) 


1 create an FIFO queue Q and empty sets R, Ti 

2 (5.enqueue('c) j // first backward BFS 

3 while Q is not empty do 

4 u ^ Q-dequeueO 

5 Ti i — Ti U {u} 

6 foreach unvisited w G N~ (u) such that {w, u) is live do 

7 I Q.enqueue(w) and mark w visited 

8 if Ti n 5*0 0 then 

// auxiliary forward pass to determine B adoption 


clear Q, enqueue all nodes of Ti H Sb into Q, and execute line 3 to line 7 in Algorithm 2 
10 execute line 8 to line 16 in Algorithm 2 // second backward BFS 


root V. The first round determines the necessary scope of forward labeling, while the second one generates 
the RR-set. 

First backward BFS. As usual, we create a FIFO queue Q and enqueue the random root v. We also 
sample Og uniformly at random from [0,1]. Then we repeatedly dequeue a node u until Q is empty: for each 
incoming edge {w,u), we test its live/blocked status based on probability Pw,u-, independently. If {w,u) is 
live and w has not been visited before, enqueue w and sample its Og. 

Let Ti be the set of all nodes explored. If Ti D Sb = 0, then none of the S-seeds can reach the explored 
nodes, so that forward labeling can be completely skipped. The above extreme example falls into this case. 
Otherwise, we run a residual forward labeling only from Ti H Sb along the explored nodes in Ti: if a node 
u G Ti\Sb is reachable by some s G TiD Sb via a live-edge path with all ;B-adopted nodes, and Og< qB\ii, 
u becomes ;B-adopted. Note that it is not guaranteed in theory that this always saves time compared to 
RR-SIM, since the worst case of RR-SIM+ is that TiH Sb = Sb, which means that the first round is wasted. 
However, our experimental results §7 indeed show that RR-SIM+ is at least twice faster than RR-SIM on 
three of the four datasets. 

Second backward BFS. This round is largely the same as Phase III in RR-SIM, but there is a subtle 
difference. Suppose we just dequeued a node u. It is possible that there exists an incoming edge {w, u) whose 
status is not determined. This is because we do not enqueue previously visited nodes in BFS. Hence, if in 
the previous round, w is already visited via an out-neighbor other than u, {w, u) would not be tested. Thus, 
in the current round we shall test {w,u), and decide if w belongs to R\y{v) accordingly. To see RR-SIM+ 
is equivalent to RR-SIM, it suffices to show that for each node explored in the second backward BFS, its 
adoption status w.r.t. B is the same in both algorithms. 

Lemma 7. Consider any possible world W under the Com-IC model. Let v be a root for generating an 
RR-set. For any u G V that is backward reachable from v via live-edges in W, u is determined as B-adopted 
in RR-SIM if and only if u is determined as B-adopted in RR-SIM+. 

Proof. We first prove the “if” part. Suppose u is determined as ;B-adopted in RR-SIM+. This means that 
there exists a node s G Ti H Sb, such that there is a path from s to u consisting entirely of live-edges and 
;B-adopted nodes (every node w on this path satisfies that ag < ^ 010 ). Therefore, in RR-SIM, where W is 
generated upfront, this live-edge path must still exist. Thus, u must be also ,B-adopted in RR-SIM as well. 

Next we prove the “only if” part. By definition, if u is determined as K-adopted in possible world W, 
then there exists a path P from some s G Sb to u such that the path consists entirely of live edges and all 
nodes w on the path satisfy that < qB\$- It suffices to show that if u is reachable by v backwards in W, 
then P will be explored entirely by RR-SIM+. 

Suppose otherwise. That is, there exists a node z G P, such that z cannot be explored by the first round 
backward BFS from v. We have established that in the complete possible world W, there is a live-edge path 
from z to u and from u to v respectively. Thus, connecting the two paths at node u gives a single live-edge 
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Algorithm 4: RR-CIM (G = {V,E), v, Sa) 


1 conduct forward labeling on G from Sa, cf. Eq. (4) 

2 if i; is neither A-suspended or A-potential then 

3 I return 0 as the RR-set 

4 (3.enqueue( // Q initialized as an empty FIFO queue 


5 while Q is not empty do 


6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 
19 


u Q-dequeueO 
if u is ^-suspended then 
if u is ^^-diffusible then 

R i — R U {w} // R was initialized as 0 

conduct a secondary backward BPS from u via B diffusible nodes, and add all explored 
nodes to R 
else i? ^ i? U {u} 
else if u is A-potential then 
if u is AK-diffusible then 

foreach unvisited w G N~{u) s.t. {w,u) live do 
Q.enqueue( w')'^ // also mark it visited 

else 

Sf G- nodes visited in a secondary forward BPS 
Sb G- nodes visited in a secondary backward BPS 
R ^ RU {m} if Sf n Sb contains an A-suspended node uq 


20 return R as the RR-set 


path Pz from z to v. Now recall that the continuation of the first backward BPS phase in RR-SIM+ relies 
solely on edge status (as long as an edge {w, u) is determined live, w will be visited by the BPS). This means 
that z must have been explored in the first backward BPS allow the backward path from v to u and then 
along the path P, which is a contradiction. □ 

The analysis on expected time complexity is similar: We can show that the expected running time of RR- 
SIM+ is O {{k + ^)(|R| -I- |if|) log |R| (1 -I- EPTbi/EPT 32 )), where EPTbi {EPT 32 ) is the expected number 
of edges explored in the first (resp., second) backward BPS. Compared to RR-SIM, EPT 32 is the same as 
EPT 3 in RR-SIM, so RR-SIM+ will be faster than RR-SIM if EPT 31 < EPTp, i-e., if the first backward 
BPS plus the residual forward labeling explores fewer edges, compared to the full orward labeling in RR-SIM. 

6.3 Generating RR-Sets for CompInfMax 

In CompInfMax, by Definition 1, a node u belongs to an RR-set Rw{v) iff v is not A-adopted without 
any S-seed, but turns A-adopted when u is the only B-seed. It turns out that constructing RR-sets for 
CompInfMax following the above definition is significantly more difficult than that for SelfInfMax. This 
is because when qA\</i < Qa\B gB |0 < qB\A = f, A and B complement each other, and thus a simple 
forward labeling from the fixed A-seed set, without knowing anything about B, will not be able to determine 
the A adoption status of all nodes. This is in contrast to SelfInfMax with one-way complementarity for 
which ,B-diffusion is fully independent of A. Thus, when generating RR-sets for CompInfMax, we have to 
determine more complicated status in a forward labeling process from A-seeds, as shown below. 

Phase I: forward labeling. The nature of CompInfMax requires us to identify nodes with the potential 
to be A-adopted with the help of B. To this end, we first conduct a forward search from Sa to label the nodes 
their status of A. As in RR-SIM, we also employ lazy sampling. The algorithm first enqueues all A-seeds 
(and labels them A-adopted) into a FIFO queue Q. Then we repeatedly dequeue a node u for processing 
until Q is empty. Let v be an out-neighbor of u. Flip a coin with bias pu,v to determine if edge (u, v) is live. 
If yes, we determine the label of v to be one of the following: 
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( 4 ) 


^-adopted, 

.4-rejected, 

^-suspended, 

vA-potential, 


if u is y^-adopted A a\ < q ^|0 
if a\ > qA\Bi regardless of u’s status 
if u is y^-adopted A S {qA\$iqA\B\ 
if u is ^-suspended/potential A < qais 


Here, ^-potential is just a label used for bookkeeping and is not a state. Then node v is added to Q 
unless it is ^-rejected. Note that both ^-suspended and ^-potential nodes can turn into ^-adopted with 
the complementary effect of B. The main difference is an ^-suspended node is informed of A, while an 
^-potential is not and the informing action must be triggered by H-propagation. Also, unlike a typical BFS, 
the forward labeling may need to revisit a node: if u is A-adopted (just dequeued) and v is previously labeled 
A-potential, v should be “promoted” to A-suspended. This occurs when v is first reached by a live-edge path 
through an A-suspended/potential in-neighbor, but later v is reached by a longer path through an A-adopted 
in-neighbor. 

To facilitate the second phase, we define additional node labels AB-diffusible and B-diffusible. Node v is 
A;B-diffusible if v can adopt both A and B when v is informed about both A and B\ while v is H-diffusible if 
V can adopt B when it is informed about B. Accordingly, the technical conditions for them are given below: 


I AH-diffusible, if V ((g ^|0 < 0)4 < Qaib) A (a^ < 9010 )) 

1 H-diffusible, if ag < gB|0V v is A-adopted as labeled in Eq.(4) 

Note that these diffusible labels are only based on a node’s local state, and they are not limited to the 
nodes explored in the first phase — some nodes may only be explored in the second phase and they also 
need to be checked for these diffusible labels. 

Phase II: RR-set generation. The second phase features a primary backward search from a random root v. 
Also, a number of secondary searches (from certain nodes explored in the primary search) may be necessary 
to find all nodes qualified for the RR-set. Intuitively, the primary backward search is to locate A-suspended 
nodes u via AH-diffusible and A-potential nodes, since once such a node u adopts B, it will adopt A and 
then through those AS-diffusible and A-potential nodes, the root v will adopt A and B. Thus such a node 
u can be put into the RR-set of v. In addition, if such node u is also AH-diffusible, then any B seed w that 
can activate u to adopt B via ;B-diffusible nodes can also be put into the RR-set of v, and we find such nodes 
w using a secondary backward search from u via ;B-diffusible nodes. However, some additional complication 
may arise during the search process, and we cover all cases in Algorithm 4 and explain them below. 

We first sample a root v randomly from V. In case v is labeled A-adopted or A-rejected, we simply return 
i? as 0 because no ;B-seed set can change u’s adoption status of A (lines 2 to 3). The primary search then 
starts. It first enqueues v into a FIFO queue Q. Now consider a node u dequeued from Q. Four cases arise. 


• Case 1: u is A-suspended and AB-diffusible (lines 8 to 10). We add u to R. Moreover, any node w 
that can propagate H to u by itself should also be added to R. To find all such w’s, we launch a 
secondary backward BFS from u via H-diffusible nodes. In particular, we conduct a reverse BFS from 
u to explore all nodes that could reach u via H-diffusible nodes, and put all of them into set R. If this 
secondary search touches a node w that is not H-diffusible, we put w in R but do not further explore 
the in-neighbors of w. 

• Case 2: u is A-suspended but not AB-diffusible (line 11). Add u to R, but do not initiate a secondary 
search, because u cannot adopt A or B even if it is informed of both A and B, and thus the only way 
to make it adopt B is to make it a B seed. 

• Case 3: u is A-potential and AB-diffusible (lines 13 to 15). We enqueue all w € N~ (u) such that [w, u) 
is live in W and continue without adding u to R, as u cannot even be informed of A by Sa alone, and 
hence cannot propagate A to u by itself. 
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Figure 3: Case 4 of primary backward search in RR-CIM 


• Case 4- u is A-potential hut not AB-diffusible (lines 16 to 19). This is the most complicated case that 
needs a special treatment. In general, we should stop the primary backward search at u and try other 
branches, because u is not yet ^-informed and u cannot help in diffusing A and B even when informed 
of A and B. However, there is a special case in which we can still put u in i? (making u a H-seed): u 
can reach an yl-suspended and ^;B-diffusible node uq via a H-diffusible path such that u can activate 
uq in adopting B through this path, and then uq can reach back u via an ylH-diffusible path, such 
that uo can activate u in adopting A. E.g., consider Figure 3 (all edges are live): a is an yl-seed, u is 
yl-potential but not ^.B-diffusible, and uq is ylH-diffusible and yl-suspended. 

To identify such u, we start two secondary BPS from u, one traveling forwards, one backwards. The 
forward search explores all S-diffusible nodes reachable from u and puts them in a set Sf, and stops at a 
node w when w is not H-diffusible, but also puts w in set Sf. The backward search explores all ^H-diffusible 
and .A-potential/suspended/adopted nodes that can reach u and puts them in set Sh- If there is a node 
Uq G Sf n Sb that is yl-suspended, then we can put u into R. After this special treatment, we stop exploring 
the in-neighbors of u in the primary search and continue the primary search elsewhere. We have: 

Theorem 8. Suppose that qjf\$ < qA\B 9b|0 ^ 901^:1 = 1- The RR-sets generated by the RR-CIM algorithm 
satisfies Definition 1 for the CompInfMax problem. As a result, Theorem 6 applies to GeneralTIM with RR- 
CIM in this case. 

Expected Time Complexity. Both phases of RR-CIM require more computations compared to RR-SIM. 
First, the number of edges explored in Phase I, namely EPT p, is larger in RR-CIM, as the forward labeling 
here needs to continue beyond just A-adopted nodes. For Phase II, let EPT bs be the expected number of 
edges pointing to nodes in R and EPT bo be the expected number of all other edges examined in this phase 
(including both primary and secondary searches). Thus, we have: 

Lemma 8. In expectation, GeneralTIM with RR-CIM runs in O (^{k + ^)(|F| + \E\) log \V\ ^1 -I- 
time. 


6.4 The Sandwich Approximation Strategy 

We present the Sandwich Approximation (SA) strategy that leads to algorithms with data-dependent ap¬ 
proximation factors for SelfInfMax and CompInfMax in the general mutual complement case of Com-IC 
(9^10 ^ Qa\B gg|0 < qB\A) when submodularity may not hold. In fact, SA can be seen as a general 
strategy, applicable to any non-submodular maximization problems for which we can find submodular upper 
or lower bound functions. 

Let a : 2^ ^ IR>o be non-submodular. Let p and u be submodular and defined on the same ground 
set V such that p(S) < cr(S) < rz(S') for all S C V. That is, p (p) is a lower (resp., upper) bound on a 
everywhere. Consider the problem of maximizing tr subject to a cardinality constraint k. Notice that if the 
objective function were p or zz, the problem would be approximable within 1 — 1/e (e.g., max-Zc-cover) or 
1 — 1/e — e (e.g., influence maximization) by the greedy algorithm [15,20]. A natural question is: Can we 
leverage the fact that p and v “sandwich” a to derive an approximation algorithm for maximizing a? The 
answer is “yes”. 

Sandwich Approximation. First, run the greedy algorithm on all three functions. It produces an approx¬ 
imate solution for p and v. Let S^, So-, S^, be the solution obtained for p, a, and v respectively. Then, select 
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the final solution to tr to be 


Ssand = argmax (j{S). 


(5) 


Theorem 9. Sandwich Approximation solution gives: 

cr{Ssand) > max • (1 - 1/e) • a{S:), (6) 

where S* is the optimal solution maximizing a (subject to cardinality constraint k). 

Remarks. While the factor in Eq. (6) involves S*, generally not computable in polynomial time, the first 
term inside max{.,.}, involves 5"^ can be computed efficiently and can be of practical value (see Table 8 in 
§7). We emphasize that SA is much more general, not restricted to cardinality constraints. E.g., for a general 
matroid constraint, simply replace 1 — 1/e with 1/2 in (6), as the greedy algorithm is a 1/2-approximation 
in this case [20]. Furthermore, monotonicity is not important, as maximizing general submodular functions 
can be approximated within a factor of 1/2 [4], and thus SA applies regardless of monotonicity. On the other 
hand, the true effectiveness of SA depends on how close v and p, are to a: e.g., a constant function can be a 
trivial submodular upper bound function but would only yield trivial data-dependent approximation factors. 
Thus, an interesting question is how to derive v and p that are as close to a as possible while maintaining 
submodularity. 

Next, we apply SA to both SelfInfMax and CompInfMax in the general mutual complementarity 
case (Q^). 

SelfInfMax. GeneralTIM with RR-SIM or RR-SIM+ provides a (1 — 1/e — e)-approximate solution with 
high probability, when < g_ 4 |e and gg |0 = (1b\A- When function v (upper bound) can be 

obtained by increasing g0|0 to while p (lower bound) can be obtained by decreasing ge|_4 to qt 3 \%- 
CompInfMax. GeneralTIM with RR-CIM provides a (1 — 1/e — e)-approximate solution with high prob¬ 
ability, when (3'_4|0 < qjp^Q and <7010 < <701^ = 1. When g0|^ is not necessarily 1, we obtain an upper bound 
function by increasing <701^ to 1. 

Theorem 10 . Suppose ( 7^10 < ( 7^10 and <7010 < qB\A- Then, under the Com-IC model, for any fixed A and B 
seed sets Sa o^nd Sb, crA{SA, Sb) is monotonically increasing w.r.t. any one of {qA\$,qA\Bi<lB\$i(lB\A\ with 
other three GAPs fixed, as long as after the increase the parameters are still in Q'^'. 

Putting it all together, the final algorithm for SelfInfMax is GeneralTIM with RR-SIM/RR-SIM+ and 
SA. Similarly, the final algorithm for CompInfMax is Algorithm 1 with RR-CIM and SA. It is important 
to see how useful and effective SA is in practice. We address this question head on in §7, where we “stress 
test” the idea behind SA. Intuitively, if the GAPs are such that (7010 and qB\A are close, the upper and lower 
bounds (p and p) obtained for SelfInfMax can be expected to be quite close to a. Similarly, when qj3\A 
close to 1, the corresponding upper bound for CompInfMax should be quite close to a. We consider settings 
where (7010 and q' 0|_4 are separated apart and similarly (70|_4 is not close to 1 and measure the effectiveness of 
SA (see Table 8). 

7 Experiments 

We perform extensive experiments on three real-world social networks. We first present results with syn¬ 
thetic GAPs (§7.1); then we propose a method for learning GAPs using action log data (§7.2), and conduct 
experiments using learnt GAPs (§7.3). 

Datasets. Flixster is collected from a social movie site and we extract a strongly connected component. 
Douban is collected from a Ghinese social network [25], where users rate books, movies, music, etc. We crawl 
all movie & book ratings of the users in the graph, and derive two datasets from book and movie ratings: 
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Douban-Book Douban-Movie Flixster Last.fm 


^ nodes 

23.3K 

34.9K 

12.9K 

61K 

# edges 

141K 

274K 

192K 

584K 

avg. out-degree 

6.5 

7.9 

14.8 

9.6 

max. out-degree 

1690 

545 

189 

1073 


Table 1: Statistics of graph data (all directed) 


SelfInfMax 

VanillalC 

Copying 

9A|0 

0.1 

0.3 

0.5 

0.1 

0.3 

0.5 

Douban-Book 

5.89% 

0.93% 

0.50% 

85.7% 

207% 

301% 

Douban-Movie 

24.7% 

3.30% 

1.72% 

13.3% 

68.8% 

122% 

Flixster 

35.5% 

11.3% 

5.15% 

16.7% 

48.0% 

84.8% 

Last.fm 

31.5% 

2.75% 

0.70% 

22.6% 

88.5% 

168% 

CompInfMax 

VanillalC 

Copying 

9B|0 

0.1 

0.5 

0.8 

0.1 

0.5 

0.8 

Douban-Book 

13.4% 

31.2% 

25.6% 

49.9% 

32.9% 

30.7% 

Douban-Movie 

135% 

151% 

101% 

14.4% 

7.46% 

3.84% 

Flixster 

81.7% 

58.5% 

24.9% 

13.6% 

8.21% 

10.7% 

Last.fm 

140% 

110% 

48.3% 

10.6% 

9.12% 

6.77% 


Table 2: Percentage improvement of GeneralTIM over VanillalC Sz Copying, where the other seed 
set is chosen to be the 101st 200th from the VanillalC order 


Douban-Book and Douban-Movie. Last.fm is taken from the popular music website with social networking 
features. Table 1 presents basic stats of the datasets. For all graphs, we learn influence probabilities on edges 
using the method proposed in [12], which is widely adopted in prior work [8]. Links in Flixster and Last.fm 
networks are undirected, and we direct them in both directions. Links in Douban network are derived from 
follower-followee relationships and in our dataset, there is an edge from rt to u if u follows u on Douban. 

7.1 Experiments with Synthetic GAPs 


SelfInfMax 

VanillalC 

Copying 

9A|0 

0.1 

0.3 

0.5 

0.1 

0.3 

0.5 

Douban-Book 

2.16% 

1.12% 

0.71% 

133% 

419% 

676% 

Douban-Movie 

4.38% 

1.49% 

0.87% 

236% 

737% 

1283% 

Flixster 

10.6% 

0% 

0% 

134% 

352% 

641% 

Last.fm 

3.76% 

2.65% 

1.65% 

398% 

1355% 

2525% 

CompInfMax 

VanillalC 

Copying 

Ql3\$ 

0.1 

0.5 

0.8 

0.1 

0.5 

0.8 

Douban-Book 

548% 

644% 

717% 

834% 

238% 

252% 

Douban-Movie 

1060% 

994% 

713% 

637% 

106% 

89.2% 

Flixster 

956% 

152% 

102% 

888% 

119% 

95% 

Last.fm 

1361% 

982% 

710% 

489% 

65.3% 

63.6% 


Table 3: Percentage improvement of CeneralTIM over VanillalC &z Copying, where the other seed 
set is randomly chosen 

We first evaluate our proposed algorithms using synthetic GAPs. We compare with two intuitive baselines: 
(i) VanillalC: It selects k seeds using TIM algorithm [24] under the classic IC model, essentially ignoring the 
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SelfInfMax 

VanillalC 

Copying 

9A|0 

0.1 

0.3 

0.5 

0.1 

0.3 

0.5 

Douban-Book 

0.34% 

0.34% 

0 % 

0.34% 

0.34% 

0 % 

Douban-Movie 

0.64% 

0.54% 

0.36% 

0.64% 

0.54% 

0.36% 

Flixster 

0 % 

0 % 

0 % 

0 % 

0 % 

0 % 

Last.fm 

1.73% 

1.38% 

0.80% 

1.73% 

1.38% 

0.80% 

CompInfMax 

VanillalC 

Copying 

9B|0 

0.1 

0.5 

0.8 

0.1 

0.5 

0.8 

Douban-Book 

3.53% 

1.19% 

0.24% 

1.03% 

0.18% 

0.05% 

Douban-Movie 

141% 

22 .2% 

9.87% 

138% 

20.7% 

8.67% 

Flixster 

3.93% 

1.63% 

-1.69% 

3.93% 

1.63% 

-1.69% 

Last.fm 

5.38% 

2.19% 

0.69% 

2.72% 

0 .11% 

-0.97% 


Table 4: Percentage improvement of GeneralTIM over VanillalC &: Copying, where the other seed 
set is chosen to be the top-100 nodes by VanillalC 


other product and the NLA in Com-IC model; {ii) Copying: For SelfInfMax, it simply selects the top-A: 
B-seeds to be A-seeds and vice versa for CompInfMax. 

In SelfInfMax, we set = 0.75, 9^10 = 0-5) and (7_4|0 is set to 0.1, 0.3, 0.5, which represent 

strong, moderate, and low complementarity. In CompInfMax, we set 9^10 = 0.1, q^iB = Qb\A = 0-9> such 
that the room for B to complement A is sufficiently large to distinguish between algorithms. We vary (7 b |0 
to be 0.1, 0.5, and 0.8. 

Lots of possibilities exist for setting the opposite seed set, i.e., S-seeds for SelfInfMax and A-seeds 
for CompInfMax. We test three representative cases: (1) randomly selecting 100 nodes - this models our 
complete lack of knowledge; (2) running VanillalC and selecting the top-100 nodes ~ this models a situation 
where we assume the advertiser might use an advanced algorithm such as TIM to target highly influential 
users; (3) running VanillalC and selecting the 101st to 200th nodes - this models a situation where we assume 
those seeds are moderately influential. 

Table 2 shows the percentage improvement of our algorithms over the two baselines, for the case of 
selecting the 101st to 200th nodes of VanillalC as the fixed opposite seed set. As can be seen, CeneralTIM 
performs consistently better than both baselines, and in many cases by a large margin. 

Table 3 shows the percentage improvement of CeneralTIM over VanillalC and Copying baselines when the 
other seed set (S-seeds for SelfInfMax and A-seeds for CompInfMax) consists of 100 random nodes. As 
can be seen, GeneralTIM is significantly better except when comparing to VanillalC in SelfInfMax. This is 
not surprising, as when S-seeds are chosen randomly, they are unlikely to be very influential and hence it is 
rather safe to ignore them when selecting A-seeds, which is essentially how VanillalC operates. 

Table 4 shows the results when the other seed set is chosen to be the top-100 nodes from VanillalC. These 
nodes represent the most influential ones under the IC model that can be found efficiently in polynomial 
time^. In this case, the advantage of GeneralTIM over VanillalC and Copying is less significant. On Flixster 
dataset, the three algorithms achieve the same influence spread for SelfInfMax, while for CompInfMax, 
when gB|0 = 0.8, we even observe a slight disadvantage of GeneralTIM in certain cases. The reason is that 
when (7 b |0 = 0.8 (quite close to 1), targeting the input A-seeds to be ,B-seeds is itself a good strategy (see 
Theorem 2). For SelfInfMax, we remark that VanillalC and Copying is equivalent. 

Overall, considering Tables 2, 3, and 4 all together, we can see that in the vast majority of all test cases, 
GeneralTIM outperforms these two baselines, often by a large margin. This demonstrates that GeneralTIM 
is robust w.r.t. different selection methods of the opposite seed set. Furthermore, in real-world scenarios, 
the opposite seed sets may simply consist of “organic” early adopters, i.e., users who adopt the product 
spontaneously. The robustness of GeneralTIM is thus highly desirable as it is often difficult to foresee which 
users would actually become organic early adopters in real life. 

^Recall that influence maximization is NP-hard under the IC model, and thus the optimal top-100 most influential nodes 
are difficult to find. The seeds found by VanillalC can be regarded as a good proxy. 


22 












_-4_ B _ g^|0 _ qA\B 

Monster Inc. Shrek .88 ± .01 .92 ± .01 

Gone in 60 Seconds Armageddon .63 ± .02 .77 ± .02 

Harry Porter: Prisoner of Azkaban What a Girl Wants .85 ± .01 .84 ± .02 

Shrek The Fast and The Furious .92 ± .02 .94 ± .01 


gB|0 _ qB\A 

.92 ±.01 .96 ±.01 

.67 ± .02 .82 ± .02 

.66 ±.02 .67 ±.02 

.80 ± .02 .79 ± .02 


Table 5: Selected GAPs learned for movies from Flixster 


Also, in our model the influence probabilities on edges are assumed independent of the product; without 
this assumption Copying and VanillalC would perform even more poorly. If we additionally assume that 
the GAPs are user-dependent, VanillalC would deteriorate further. In contrast, our GeneralTIM and RR-set 
generation algorithms can be easily adapted to both these scenarios. 

7.2 Learning GAPs from Real Data 

Finding Signals from Data. For Flixster and Douban, we learn GAPs from timestamped rating data, 
which can be viewed as action logs. Each entry is a quadruple (u,i, a,t.n i a), indicating user u performed 
action a on item i at time We count a rating quadruple as one adoption action and one informing 

action: if someone rated an item, she must have been informed of it first, as we assume only adopters rate 
items. A key challenge is how to And actions that can be mapped to informing events that do not lead to 
adoptions. Fortunately, there are special ratings providing such signals in Flixster and Douban. The former 
allows users to indicate if they “want to see” a movie, or are “not interested” in one. We map both signals to 
the actions of a user being informed of a movie. The latter allows users to put items into a wish list. Thus, 
if a book/movie is in a user’s wish list, we treat it as an informing action. For Douban, we separate actions 
on books and movies to derive two datasets: Douban-Book and Douban-Movie. 

Learning Method. Gonsider two items A and B in an action log. Let Ra and I a be the set of users who 
rated A and who were informed of A, respectively. Glearly, Ra C I a. Thus, 

QAIH = \RA \ Rs^rateA] / |G \ 

where Rs^^atcA is the set of users who rated both items with B rated first, and Rs^informA is the set of users 
who rated B before being informed of A. Next, qA\B is computed as follows: 

QA\B = iRB^r-ateAl / I^ ^ | • 

Similarly, (7g|0 and qB\A can be computed in a symmetric way. 

Tables 5-7 depict selected GAPs learned from Flixster, Douban-Book, and Douban-Movie datasets, 
using methods in §7. Here we not only show the estimated value of Q, but also give 95% confidence intervals. 
By the definition of GAPs (§3), we can treat each GAP as the parameter of a Bernoulli distribution. Gonsider 
any GAP, denoted q, and let q be its estimated value from action log data. The 95% confidence interval^ of 
q is given by 

q - 1.96^ q{l - q)/nq, q + 1.96^ q{l - q)/nq , 
where Uq is the number of samples used for estimating q. 

7.3 Experiments with Learned GAPs 

Baselines. We compare GeneralTIM with several baselines commonly used in the literature: HighDegree: 
choose the k highest out-degree nodes as seeds; PageRank: choose the k nodes with highest PageRank score; 
Random: choose k seeds uniformly at random. We also include the Greedy algorithm [15] with lOK iterations 

®See any standard textbooks on probability theory and statistics 
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_ ^ _ g __ QA\& 

The Unbearable Lightness of Being Norwegian Wood (Japanese) .75 ± .01 

Harry Potter and the Philosopher’s Stone Harry Potter and the Half-Blood Prince .99 ib 1 
Stories of Ming Dynasty III (Chinese) Stories of Ming Dynasty VI (Chinese) .94 ± .01 
Fortress Besieged (Chinese) Love Letter (Japanese) .89 ± .01 


QAIB _ jB\<i<1B\A 


.85 ± .02 
1.0 ± 0 
1.0 ± 0 
.91 ± .03 


.92 ± .01 
.97 ± .01 
.88 ± .03 
.82 ± .02 


.97 ± .01 
.98 ± .01 
.98 ± .01 
.83 ± .03 


Table 6: Selected GAPs learned for movies from Douban-Book 


A 

B 

QAlti 

<1A\B 

Qb\$ 

Qb\a 

Up 

3 Idiots 

.92± .01 

.94± .01 

.92± .01 

.93± .01 

Pulp Fiction 

Leon 

.81 ±.01 

.83± .01 

.95 ± .00 

.98± .01 

The Silence of the Lambs 

Inception 

.90± .01 

.86± .01 

.92± .01 

.98± .01 

Fight Club 

Se7en 

.84± .01 

.89± .01 

.89± .01 

.95± .01 


Table 7: Selected GAPs learned for movies from Douban-Movie 


of MC simulations to compute influence spread for the Com-IC diffusion processes. VanillalC and Copying 
are omitted as the results are similar to those in §7.1 (when the GAPs are close to each other). 
Parameters. The following pairs of items are tested: 

• Douban-Book: The Unbearable Lightness of Being as A and Norwegian Wood as B, and Q = 
{0.75,0.85,0.92,0.97}. 

• Douban-Movie: Fight Club as A and Se7en as B, and Q = {0.84,0.89,0.89,0.95}. 

• Flixster: Monster Inc as A and Shrek as B. 

• Last.fm: There is no signal in the data to indicate informing events, so the learning method in §7.2 is 
not applicable. As a result, we use synthetic Q = {0.5,0.75,0.5,0.75}. 

In all four datasets, A and B are mutually complementary, for which self/cross-submodularity does not 
hold (§5). Hence, Sandwich Approximation (SA) are used by default for GeneralTIM and Greedy [15]. Unless 
otherwise stated, k = 50. For GeneralTIM, f = 1 so that a success probability of 1 — 1/|U| is ensured [24]. In 
SelfInfMax (resp. CompInfMax), the input ;B-seeds (resp. A-seeds) are chosen to be the 101st to 200th 
seeds selected by VanillalG. We set e = 0.5, which is chosen to achieve a balance between efficiency (running 
time) and effectiveness (seed set quality). In what follows we empirically validate that influence spread is 
almost completely unaffected when e varies from 0.1 to 1. 

Algorithms are implemented in CH—h and compiled using gH—h 03 optimization. We run experiments on 
an openSUSE Linux server with 2.93GHz GPUs and 128GB RAM. 

Effect of e in GeneralTIM 

As mentioned in §6, e controls the trade-off between approximation ratio and efficiency. Figure 4 plots 
influence spread and running time (log-scale) side-by-side, as a function of e, on Flixster and Douban-Book 
for both problems. The results on other datasets are very similar and thus omitted. We can see that as e 
goes up from 0.1 to 0.5 and 1 (in fact, e = 1 > (1 — 1/e) means theoretical approximation guarantees are 
lost), the running time of all versions of GeneralTIM (RR-SIM, RR-SIM+, RR-GIM) decreases dramatically, 
by orders of magnitude, while, in practice, influence spread (SelfInfMax) and boost (CompInfMax) are 
almost completely unaffected (the largest difference among all test cases is only 0.45%). 

Quality of Seeds 

The quality of seeds is measured by the influence spread or boost achieved. We evaluate the spread of seed 
sets computed by all algorithms by MG simulations with lOK iterations for fair comparison. As can be seen 
from Figures 5 and 6, our RR-set algorithms are consistently the best in almost all test cases, often leading 
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Figure 4: Effects of e on RR-set algorithms (Flixster and Donban-Book) 
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increase in A-Spread (x 100) increase in A-Spread (x 100) 



















(c) Flixster (d) Last.fm 

Figure 5: y^-Spread vs. 15^1 for SelfInfMax (RR— GeneralTIM with RR-SIM+, Page — PageRank; 
Deg — High-Degree, Rand — Random) 


by a significant margin. The results of Greedy are omitted, since the spread it achieves is almost identical 
to GeneralTIM, matching the observations in prior work [24]. RR-SIM results are identical to RR-SIM+, and 
thus also omitted. 

For SelfInfMax, GeneralTIM with RR-SIM+ is 13%, 2.7%, 100%, and 13% better than the next best 
algorithm on Douban-Book, Douban-Movie, Flixster and Last.fm respectively, while for CompInfMax, 
GeneralTIM with RR-CIM is 31%, 31%, 122%, and 51% better. The boost in .A-spread provided by B- 
seeds (GeneralTIM with RR-GIM) is at least 6% to 15% of the original Al-spread by only. HighDegree 
performs well, especially in graphs with many nodes having large out-degrees (Douban-Movie, Last.fm), while 
PageRank produces good quality seeds only on Last.fm. Random is consistently the worst. The performances 
of baselines are generally consistent with observations in prior works [9, 10,24] albeit for different diffusion 
models. 

Running Time and Scalability 

We compare the running time of GeneralTIM to Greedy, shown in Figure 7(a). As can be seen, for SelfInf¬ 
Max, GeneralTIM with RR-SIM, RR-SIM+ is about two to three orders of magnitude faster than Greedy; for 
CompInfMax, GeneralTIM with RR-CIM is also about two orders of magnitude faster than Greedy. In addi¬ 
tion, we observe that RR-SIM+ is 12, 8, 7, and 2 times as fast as RR-SIM on Douban-Book, Douban-Movie, 
Flixster, and Last.fm respectively. The running time of HighDegree, PageRank, and Random baselines are 
omitted since they are typically very efficient [8-10]. 

We then use larger synthetic networks to test the scalability of GeneralTIM with our RR-set generation 
algorithms. We generate power-law random graphs of 0.2, 0.4, ..., up to 1 million nodes with a power-law 
degree exponent of 2.16 [9]. These graphs have an average degree of about 5. We use the GPAs from Flixster. 
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(a) Douban-Book, cr^( 5 '_ 4 , 0 ) = 612 (b) Douban-Movie, cryt(>S'_ 4 , 0) = 2357 




Number of B-Seeds 


(c) Flixster, cr_ 4 (S'^, 0 ) = 1643 


(d) Last.fm, cr^(S'_ 4 , 0 ) = 1568 


Figure 6: Boost in yl-Spread vs. |5'b| for CompInfMax (RR — GeneralTIM with RR-CIM) 


We can see that GeneralTIM with RR-SIM+ within 6.2 hours for the 1-million node graph, and its running time 
grows linearly in graph size, which indicates great scalability. RR-CIM is slower due to the inherent intricacy 
of CompInfMax, but it also scales linearly. To put its running time measures in perspective. Greedy — the 
only other known approximation algorithm for CompInfMax — takes about 48 hours on Flixster (12. 9K 
nodes), while GeneralTIM with RR-CIM is 4 hours faster on a graph 10 times as large. 

Approximation Factors by Sandwich Approximation 

Recall from §6.4 that the approximation factor yielded by SA is data-dependent: cr(Ssand) > 
max{ } ■ (1 ~ 1/e — e) • o'iS*). To see how good the SA approximation factor is in real- 

world graphs, we compute a{S^)/v{S^), as SA is guaranteed to have an approximation factor of at least 
(1 - 1 /e - e) • a{S^)/v[Su). 

In the GAPs learned from data, both ( 7 b |_4 — gB |0 and — qA\$ ^re small and thus likely “friendly” 
to SA, as we mentioned in §6.4. Thus, we further “stress test” SA with more adversarial settings: First, set 
<7^10 = 0-3 and qA\B = 0.8; Then, for SelfInfMax, fix q^^A = 1 and vary ( 7 b |0 from {0.1, 0.5,0.9}; for 
CompInfMax, fix <7010 = O.I and vary qB\A from (0.1, 0.5,0.9}. 

Table 8 illustrates the results on all datasets with both learned GAPs and artificial GAPs. We use 
shorthands SIM and CIM for SelfInfMax and CompInfMax respectively. Subscript learn means the GAPs 
are learned from data. In stress-test cases (other six rows), e.g., for SIM, subscript 0.5 means <7010 = .5, while 
for CIM, it means qB\A = -S- As can be seen, with real GAPs, the ratio is extremely close to 1, matching 
our intuition. For artificial GAPs, the ratio is not as high, but most of them are still close to 1. E.g., in 
the case of SIM 0 5 , a{Sv)/v{S^) ranges from 0.628 (Last.fm) to 0.969 (Douban-Movie), which correspond 
to an approximation factor of 0.40 and 0.61 (e omitted). Even the smallest ratio (0.492 in SIMq.i, Flixster) 
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Figure 7: Running time 



Douban-Book 

Douban-Movie 

Flixster 

Last.fm 

SIMiearn 

0.996 

0.999 

0.996 

0.999 

SIMo.l 

0.652 

0.962 

0.492 

0.519 

SIMo.5 

0.770 

0.969 

0.633 

0.628 

SIMo.9 

0.946 

0.985 

0.926 

0.879 

CIMiearn 

0.973 

0.918 

0.950 

0.825 

CIMo.l 

0.913 

0.832 

0.933 

0.772 

CIMo.5 

0.936 

0.885 

0.969 

0.857 

CIMo.9 

0.956 

0.976 

0.993 

0.959 


Table 8: Sandwich approximation: a{Su)/v{Sv) 


would still yield a decent factor at about 0.3. This shows that SA is fairly effective and robust for solving 
non-submodular cases of SelfInfMax and CompInfMax. 

We also measure the effectiveness of SA by comparing the spread achieved by seed sets Sa, 3,^, 5”^ obtained 
w.r.t. the original, upper bound, and lower bound functions respectively. Such spread must be computed 
using the original function a, and in our case, the unaltered GAPs. More specifically, we calculate the relative 
error defined as follows (for CompInfMax, disregard S'^). 

QA - max{|a(5'^) - a(5'^)|, |g(5'^) - g(5'^)|} 


In all four datasets, for both problems, the largest error is only 0.2%! To see if this is due to that and 
Qb\A being close in the GAPs learned from action log data, we further “stress test” SA with a much more 
adversarial set-up. For SelfInfMax, vary ( 7 e |0 to be {.1, .5, .9} and fix = .96. For CompInfMax, vary 
Qi3\A fo t )6 {•!, .5, .9} and fix gg |0 = .1 (to maintain complementarity). No change is made to g_ 4|0 and 

Figure 8 compares a{Sa-), (t(S'^), and a{Si,) on Flixster. As can be seen, even in this adversarial setting, 
SA is still highly effective for both SelfInfMax and CompInfMax. Amongst all test cases, the largest 
error is 0.4%. The results on other datasets are very similar and hence omitted. 


8 Conclusions & Future Work 

In this work, we propose the Comparative Independent Cascade (Com-IC) model that allows any degree 
of competition or complementarity between two different propagating items, and study the novel SelfInf¬ 
Max and CompInfMax problems for complementary products. We develop non-trivial extensions to the 
RR-set techniques to achieve approximation algorithms. For non-submodular settings, we propose Sand¬ 
wich Approximation to achieve data-dependent approximation factors. Our experiments demonstrate the 
effectiveness and efficiency of proposed solutions. 


28 


































2000 



0.1 0.5 0.9 0.1 0.5 0.9 


SelfInfMax CompInfMax 

SIM(LB) CIM(UB) 

SIM SIM(UB) CIM 


Figure 8: Sandwich Approximation on Flixster 


For future work, one direction is to design more efficient algorithms or heuristics for SelfInfMax and 
(especially) CompInfMax; e.g., whether near-linear time algorithm is still available for these problems 
is still open. Another direction is to fully characterize the entire GAP space Q in terms of monotonicity 
and submodularity properties. Moreover, an important direction is to extend the model to multiple items. 
Given the current framework, Gom-IG can be extended to accommodate k items, if we allow k ■ 2^“^ GAP 
parameters — for each item, we specify the probability of adoption for every combination of other items 
that have been adopted. However, how to simplify the model and make it tractable, how to reason about 
the complicated two-way or multi-way competition and complementarity, how to analyze monotonicity and 
submodularity, and how to learn GAP parameters from real-world data all remain as interesting challenges. 
Last, it is also interesting to consider an extended Gom-IG model in which influence probabilities on edges 
are product-dependent. 
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Appendix 


A Remarks on Com-IC Model 

A.l Unreachable States 

Recall that in the Com-IC model, before an influence diffusion starts, all nodes are in the initial joint state 
(A-idle, ,B-idle). According to the diffusion dynamics defined in Figure 2, there exist five unreachable joint 
states, which are not material to our analysis and problem-solving, since none of these is relevant to actual 
adoptions, the objectives studied in SelfInfMax and CompInfMax. For completeness, we list these states 
here. 

1 . (A-idle, ;B-rejected) 

2 . (A-suspended, ;B-rejected) 

3. (A-rejected, ;B-idle) 

4. (A-rejected, ;B-suspended) 

5. (A-rejected, ;B-rejected) 

Lemma 9. In any instance of the Com-IC model (no restriction on CAPs), no node can reach the state of 
(A-idle, B-rejected), from its initial state of (A-idle, B-idle). 

Proof. Let v be an arbitrary node from graph G = (V, E,p). Note that for v to reject B, it must be first be 
informed of B (otherwise it remains ;B-idle, regardless of its state w.r.t. A), and then becomes ,B-suspended 
(otherwise it will be ;B-adopted, a contradiction). Now, note that v is never informed of A, and hence it will 
not be triggered to reconsider B, the only route to the state of ,B-rejected, according to the model definition. 
Thus, (A-idle, S-rejected) is unreachable. □ 

The argument for (A-rejected, yB-idle) being unreachable is symmetric, and hence omitted. 

Lemma 10. In any instanee of the Com-IC model (no restriction on CAPs), no node can reach the state 
of (A-suspended, B-rejected), from its initial state of (A-idle, B-idle). 

Proof. Let v be an arbitrary node from graph G = (V, E,p). Note that for v to reject B, it must be first be 
informed of B (otherwise it remains ;B-idle, regardless of its state w.r.t. A), and then becomes S-suspended 
(otherwise it will be ,B-adopted, a contradiction). Now, v transits from A-idle to A-suspended, meaning that 
V does not adopt A. This will not further trigger reconsideration, and hence v stays at S-suspended. This 
completes the proof. □ 

The argument for (A-rejected, A-suspended) being unreachable is symmetric, and hence omitted. Finally, 
it is evident from the proof of Lemma 10 that, the joint state of (A-suspended, A-suspended) is a sunken 
state, meaning the node will not get out it to adopt or reject any product. This implies that (A-rejected, 
;B-rejected) is also unreachable. 

A.2 Counter-Examples for Submodularity and Monotonicity 

The first two counter-examples show that self-monotonicity and cross-monotonicity may not hold in general 
for the Com-IC model when there is no restriction on GAPs. 

Example 1 (Non-Self-Monotonicity). Consider Figure 9. All edges have probability 1. GAPs are q ^|0 = 
q € (0,1), ( 7 _ 4 |b = ( 7 b |0 = 1, gB |_4 = 0, which means that A competes with B but B complements A. Let 
Sb = {y}. If Sjx is S' = {si}, the probability that v becomes A-adopted is 1 , because v is informed of A from 
si, and even if it does not adopt A at the time, later it will surely adopt B propagated from y, and then v 
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Figure 9: The graph for Example 1 


will reconsider A and adopt A. If it is T = {si, S 2 }, that probability is 1 — g + < 1 : w gets ^-adopted 
w.p. q blocking B and then v gets ^-adopted w.p. q; w gets yB-adopted w.p. (1 — q) and then v surely gets 
^-adopted. Replicating sufficiently many v’s, all connected to si and w, will lead to <ja{T^ Sq) < Sb)- 

The intuition is that the additional ^-seed S 2 “blocks” B-propagation as A competes with B {qB\A < 9810 ) 
but B complements A {qA\B > 9 ^| 0 )- Clearly a a is not monotonically decreasing in Sa either (e.g., in a 
graph when all nodes are isolated). Hence, a a is not monotone in Sa- n 



Figure 10: The graph for Example 2 


Example 2 (Non-Cross-Monotonicity). We use the example shown in Figure 10. All edges have probability 
1 . Nodes yi and 1/2 are the two fixed A seeds, and we want to grow B seed set from S = {si} to T = {si, S 2 }. 
GAPs satisfy 0 < qA\B < qA\<6 < 1 and 0 < gB |0 < q^^A < !> which means that A complements B but B 
competes with A. Since B competes with A, it is straightforward to have examples in which the tr^ decreases 
when B seed set grows. We use Figure 10 to show a possible world in which the growth of B seed sets leads 
to V adopting A, indicating that A spread may also increase. Even though we do not have a direct example 
showing that a a increases when B seed set grows, we believe the possible world example is a good indication 
that a A is not cross-monotone in Sb- 

Figure 10 uses Figure 9 as a gadget. Intuitively, when B seed set grows from S to T, the probability that 
2 : adopts B decreases as shown in Example 1. Then we utilize this and the fact that B competes with A to 
show that when S is the B seed set, due to B’s competition from x node v will not adopt A, but when T is 
the B seed set, there is no longer B’s competition from x and thus v will adopt A. 

The node thresholds in the possible world are as follows: qA\B < c^a — 9^|0i 98|0 < 0 :% — 9b\Aj <1a\9 < o^a^ 
Qa\B < ctyi ^ 9.4|0) all other non-specified a values take value 0, meaning that they will not block diffusion. 
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Consider first that S is the B seed set. Since ( 7 b |0 < Og, a; is informed about B from si but will not adopt 
B directly from si. From yi, we can see that A will pass through u, w and reaches x. After x adopts A, 
it reconsiders B, and since ag < x adopts B. Node x then informs z about A and B, in this order. 
However, since ( 7 _ 4|0 < z does not adopt A, but it adopts B (ag = 0 < < 7010 ). Next z informs v about 
B, which adopts B since Og = 0 < ( 7 b| 0 . This happens one step earlier than A reaches v from 7 / 2 , but since 
( 7 _ 4 |B < a^, V will not adopt A. 

Now consider that T is the B seed set. In this case, w definitely adopts B from S 2 - Since qA\B < 
and w adopts B first, w will not adopt A from yi. Because qB |0 < Og and w blocks A from reaching x, x 
will not adopt A or B. Then z will not adopt A ov B either. This allows A to reach v from y 2 , and since 
Qaib < 9 . 410 . V adopts A. 

We can certainly duplicate v enough times so that when B seed set grows from S to T, the A-spread in 
this possible world also increases, even though B is competing with A. □ 



Figure 11: The graph for Example 3 and 4 

The following two examples show that even when two items are mutually complementary, self¬ 
submodularity and cross-submodularity in general may not hold. 

Example 3 (Non-Self-Submodularity). Consider the possible world in Figure 11. All edges are live. The node 
thresholds are: for w: a'^ < 9 ^ 10 , q^iih < ag < qB\A', for z: > q^is, Oi% < 9b| 0; for v: 9^10 < a\ < 9 ^|g, 

Og < gg| 0 ; Then fix 5'g = {y}. For Sa, let S' = 0, T = {a;}, and u is the additional seed. It can be verified 
that only when Sa = T U {u}, v becomes A-adopted, violating self-submodularity. 

A concrete example of Q for which submodularity does not hold is as follows: qA\$ = 0.078432; qA\B = 
0.24392; ( 7 g |0 = 0.37556; qj 3 \A = 0.99545. Seed sets are the same as above. We denote by Pv{Sa) the 
probability that v becomes A-adopted with A-seed set Sa- It can be verified that: Pv{S) — 0, Pv{S U {u}) = 
8.898•10-^p„(^) = 0.027254, andp„(TU{M}) = 0.027383. Clearly, p„(ru{u})-p„(r) > p„(SU{u})-p„(S). 
Hence, replicating v sufficiently many times will lead to aA(T U {u}, Sg) — <ja{T, Sg) > (t^(S U {u}, Sg) — 
cr_ 4 (S, Sg), violating self-submodularity. □ 

Example 4 (Non-Cross-Submodularity). Consider the possible world in Figure 11. The node thresholds 
are: For w. 9^10 < a'X < q^|g, < ( 7 g| 0 ; for z: < 9 ^ 10 , > gg|^; for v: q ^|0 < < g^|g, < qsie- 

Fix Sa = {y}- For S'g, let S' = 0, T = {x}, and u is the additional seed. It can be verified that only when 
Sg = T U {it}, V becomes A-adopted, violating cross-submodularity. □ 

The above example applies even when qB\A = 9 b|0 < 1- The key is that qB\A < 1 ag > qB\A^ which 

prevents B to pass through z, and thus another ;B-seed u is needed to inform v of B. 

B Proofs and Additional Theoretical Results 

B.l Proofs for Results in Section 4 

Theorem 2 (re-stated). For CompInfMax, when ( 7 g |0 = 1 and k > |S_ 4 |, we can solve the problem 
optimally by setting Sg to be Sa U X, where X is an arbitrary set in V \ Sa with size k — |S^|, that is 
(7a{Sa,Sa U A) = uiaxTcv,\T\=k<^AiSA,T). 
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Proof. We first show the theorem holds in an arbitrary possible world W. Following Eq. (4), when 5'^ acts 
alone (without any ,B-seeds), we can classify all nodes into four types: y4-adopted, y4-rejected, .4-suspended, 
and y4-potential. Clearly, only y4-suspended and ^-potential may later adopt A with the help of B, and the 
other two types are not relevant to CompInfMax. 

Now, let all nodes in be S-seeds, and thus they become S-adopted, too. Since ( 7 b |0 = 1, all nodes that 
can be reached by the seeds via live-edge paths will eventually become S-adopted, including all ^-suspended 
and ^-potential nodes. For ^-suspended nodes, after adopting B, they will adopt A through re-consideration 
right away. For ^-potential nodes, the propagation of A will reach them after .4-suspended nodes upstream 
adopt A (together with B), and after being reached, they adopt both items. 

Therefore, in this possible world, making Sy[ to be ;B-seeds will “convert” all nodes that can possibly 
become y4-adopted, namely all .4-suspended and y4-potential nodes. Hence, any additional H-seed will not 
further increase the spread of A as no other nodes can possibly be “converted”. 

Since the claim in this theorem holds true for any arbitrary possible world W, it also holds for all other 
possible worlds that belong to the same equivalence class W. Since 

w 

the theorem is proven. □ 

B.2 Proofs for Results in Section 5 

Lemma 1 (re-stated). For any fixed A-seed set and B-seed set Sq, the joint distributions of the sets of 
A-adopted nodes and B-adopted nodes obtained (i) by running a Com-IC diffusion from and Sq and (ii) 
by randomly sampling a possible world W and running a deterministic cascade from and S'g in W, are 
the same. 

Proof. The proof is based on establishing equivalence on all edge-level and node-level behaviors in Com-IC 
and the PW model. By the principle of deferred decisions and the fact that each edge is only tested once in 
one diffusion, edge transition processes are equivalent. To generate a possible world, the live/blocked status 
of an edge is pre-determined and revealed when needed, while in a Com-IC process, the status is determined 
on-the-ffy. 

Tie-breaking is also equivalent. Note that each node v only needs to apply the random permutation 7r„ 
for breaking ties at most once. For Com-IC, we need to apply 7r„ only when v is transitioning out of state 
(.4-idle, H-idle) after being informed of both A and B. Clearly, this transition occurs at most once for each 
node. The same logic applies to the PW model. Thus, the equivalence is obvious due to the principle of 
deferred decisions. 

The equivalence of decision-making for adoption is straightforward as and are chosen uniformly 
at random from [0,1]. Hence, Pr[Q !)4 < q] = q, where q S {qA\<6,qA\B, 9 b| 0 , 'ZbIaI- 

As to reconsideration, w.l.o.g. we consider A. In Com-IC, the probability of reconsideration is pA = 
max{(( 7 _ 4 |B — qA\iii), 0}/(l — 9 a| 0 )- In PW, when qA\B > <?A| 0 ) this amounts to the probability that < qA\B 
given > qA\$, which is (q^je - qA\$)/ (1 - qA\$)- On the other hand, when g_4|B < g^|0, a\ > qA\$ implies 
c^A ^ 9a\Bi which means reconsideration is meaningless, and this corresponds to pA = 0 in Com-IC. Thus, 
the equivalence is established. 

Finally, the seeding protocol is trivially the same. Combining the equivalence for all edge-level and node¬ 
level activities, we can see that the two models are equivalent and yield the same distribution of A- and 
H-adopted nodes, for any given Sa and Sb- □ 

Theorem 3 (re-stated). For any fixed B-seed set Sb, cr^(S'Ai Sb) is monotonically increasing in Sa for any 
set of GAPs in Q+ and Q~. Also, cr_ 4 (S'_ 4 , Sb) is monotonically increasing in Sb for any GAPs in Q'*’, and 
monotonically decreasing in Sb for any Q“. 

For ease of exposition, we also state a symmetric version of Theorem 3 w.r.t. ctb. That is, given any 
fixed A-seed set, crB{SA, Sb) is monotonically increasing in Sb for any set of GAPs in and Q^. Also, 
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(Tb(iS'^, Sis) is monotonically increasing in for any GAPs in Q^, and monotonically decreasing in Sj\, for 
any Q“. For technical reasons and notational convenience, in the proof of Theorem 3 presented below, we 
“concurrently” prove both Theorem 3 and this symmetric version, without loss of generality. 

Proof of Theorem 3. We first fix a S-seed set Sb- Since Sjs is always fixed, in the remaining proof we ignore 
Sjs from the notations whenever it is clear from context. It suffices to show that monotonicity holds in 
an arbitrary, fixed possible world, which implies monotonicity holds for the diffusion model. Let W be an 
arbitrary possible world generated according to §5.1. 

Define ( 5 '_ 4 ) (resp. to be the set of ^-adopted (resp. ;B-adopted) nodes in possible world 

W with S _4 being the ^-seed set (and S'g being the fixed ,B-seed set). Furthermore, for any time step t > 0, 
define (resp. <i>^ {Sj\,t)) to be the set of ^-adopted (resp. ,B-adopted) nodes in W by the end of 

step t, given ^-seed set Clearly, <h^(S'^) = Ut>o$^(5'^, t) and (S'_ 4 ) = (SaG)- Let S and 

T be two sets, with S C T C V. 

Mutual Competition Q~. Our goal is to prove that for any v £V, (a) if u G <h)^(S'), then v € <I>)^(T); 
and (b) if u S (T), then v G (S). Item (a) implies self-monotonic increasing property while item (b) 
implies cross-monotonic decreasing property. We use an inductive proof to combine the proof of above two 

results together, as follows. For every t > 0, we inductively show that (i) if v G {S, t), then v G $5^ (T, t); 

and (a) if V G (T, t), then v G (S', t). 

Consider the base case of t = 0. If u S $^4 (S, 0), then it means v G S, and thus v G T = (T, 0). If 

V G (T, 0), it means v G Sg, and thus v G (S, 0) = Sg. 

For the induction step, suppose that for all t < t, (i) and (m) hold, and we show (i) and (ii) also hold 
for t = t'. For (*), we only need to consider v G $5"(S, t') \ (S, t' — 1), i.e. v adopts A at step t' when S 

is the ^-seed set. Since v adopts A, we know that < qA\$- Let U be the set of in-neighbors of v in the 
possible world W. Let Ua(Sa) = Un<^J(SA,t' - 1) and Ub(Sa) = f/C $^(S^,t' - 1), i.e. Ua(Sa) (resp. 
Ub{Sa)) is the set of in-neighbors of u in IF that adopted A (resp. B) by time — 1, when Sa is the ^-seed 
set. Since v G t'), we know that Ua{S) 0. By induction hypothesis, we have Ua{S) C Ua{T) and 

UeiT) C Ub{S). 

Thus, Ua{T) 7 ^ 0, which implies that by step t', v must have been informed of A when T is the ^-seed 
set. If < 9^1 g, then no matter if v adopted B or not, v would adopt A by step t' according to the possible 
world model. That is, v G $)^(T, F). 

Now suppose g_ 4 |g < qA\(6- For a contradiction suppose v ^ i-c., v does not adopt A by 

step t' when T is the yl-seed set. Since v has been informed of A by t', the only possibility that v does not 
adopt A is because v adopted B earlier than A, which means v G Two cases arise: 

First, if w G <1>^ (T, t' — 1), then by the induction hypothesis v G $^(S', — 1). Since v G $^4 (5, F) \ 

<i>^(S', F — 1), it means that when S is the yl-seed set, v adopts B first before adopting A, but this contradicts 
to the condition that g_ 4 |g < Therefore, v ^ <i>^(T, F — 1). 

Second, v G (T, t') \ (T, t' — 1). Since v ^ (T, t'), it means that v is informed of A at step t' 

when T is the ^-seed set, and thus the tie-breaking rule must have been applied at this step and B is ordered 
first before A. However, looking at the in-neighbors of v in IF, by the induction hypothesis, Ua{S) C Ua{T) 
and Ub{T) C Ub{S). This implies that when S is the ^-seed set, the same tie-breaking rule at A would still 
order B first before A, but this would result in v not adopting A at step t', a contradiction. Therefore, we 
know that v G (T, t'). 

The statement of (ii) is symmetric to (i): if we exchange A and B and exchange S and T, (ii) becomes 

(i) . In fact, one can check that we can literally translate the induction step proof for (i) into the proof for 

(ii) by exchanging pair A and B and pair S and T (except that (a) we keep the definitions of Ua(Sa) and 
Ub(Sa), and (b) whenever we say some set is the ^-seed set, we keep this A). This concludes the proof of 
the mutual competition case. 

Mutual Complementarity Q+. The proof structure is very similar to that of the mutual competition 
case. Our goal is to prove that for any v G V, (a) if u G $^4 (S), then v G $^4 (T); and (b) if u G (S), 
then V G (T). To show this, we inductively prove the following: For every t > 0, (i) if v G (S, t), then 

V G <h )4 (T, t); and (ii) if u G (S, t), then v G (T, t). The base case is trivially true. 
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For the induction step, suppose (i) and {ii) hold for all t < t', and we show that (i) and (ii) also hold 
for t = t'. 

For (i), we only need to consider v G (S', t') \ $5 "(^i ~ !)> i-®- adopts A at step t' when S is the 

^-seed set. Since v adopts A, we know that Since v € we know that ^ 0. By 

induction hypothesis we have U_a(S) C Ua(T). Thus we know that Uj\{T) ^ 0, which implies that by step 
i!, V must have been informed of A when T is the ^-seed set. if < (Z_ 4 | 0 , then no matter v adopted B or 
not, V would adopt A by step t' according to the possible world model. Thus, v G 

Now suppose g_ 4|0 < < qA\B- Since v G $^4 (S, t'), the only possibility is that v adopts B first by time 

t' so that after reconsideration, v adopts A due to condition < qaib- Thus we have v G {S,t'), and 

< 9810 . 

If u S (S,t' — 1), by induction hypothesis v G (T,t' — 1), which means that v adopts B by time 
t' — 1 when T is the .4-seed set. Since v has been informed of A by time t' when T is the .4-seed set, condition 

< qA\B implies that v adopts A by time t' when T is the .4-seed set, i.e. v G (T, t’). 

Finally we consider the case of u G (S,t') \ (S,t' — 1). Looking at the in-neighbors of v in W, 

V G $^(5,t'), implies that Ub{S) ^ 0. By the induction hypothesis, we have C/b(S') C U] 3 (T), and thus 
Ub(T) 7 ^ 0. This implies that when T is the .4-seed set, node v must have been informed of B by time F. 
Since Og < 9010 , we have that v adopts B by time t' when T is the .4-seed set. Then the condition a\ < qA\B 
implies that v adopts A by time t' when T is the .4-seed set, i.e. v G <P^(T,t'). 

This concludes the inductive step for item (i) in the mutual complementarity case. The induction step 
for item (ii) is completely symmetric to the inductive step for item (i). Therefore, we complete the proof for 
the mutual complementarity case. As a result, the whole theorem holds. □ 

Lemma 2 (re-stated). Consider any Com-IC instance with Q+. Given fixed A- and B-seed sets, for all 

nodes v G V, all permutations of v’s in-neighbors are equivalent in determining if v becomes A-adopted and 
B-adopted, and thus the tie-breaking rule is not needed for mutual complementary case. 

Proof. Without loss of generality, we only need to consider a node v and two of its in-neighbours and ub 
which become A-adopted and ;B-adopted at f — 1 respectively. In a possible world, there are nine possible 
combinations of the values of and We show that in all such combinations, the ordering tti = (u^, ug) 
and 712 = (wb, Uj\} produce the same outcome for v. 

1. < g_ 4|0 A ttg < qB\$. Both tti and 712 make v A-adopted and S-adopted. 

2. < ( 7 _ 4|0 a ( 7 g |0 < cig < qB\A- Both tti and 7 r 2 make v A-adopted and S-adopted. With 7 r 2 , v first 

becomes S-suspended, then A-adopted, and finally S-adopted due to re-consideration. 

3. < g_ 4|0 A ag > qB\A- Both tti and 7 r 2 makes v A-adopted only. 

4- (Z. 4|0 < < qA\B 4 Q!g < gg| 0 . Symmetric to case 2 above. 

5- qA\$ < cij, — ^A\B S (Zg |0 < Qfg < qB\A- III this case, v does not adopt any item. 

6- 9.410 < cAj, — ^A\B A ttg > qB\A- In this case, v does not adopt any item. 

7. > qA\B A Qfg < qg| 0 . Symmetric to case 3 above v is S-adopted only. 

8 . > g_ 4 |g A gg |0 < ag < qB\A- Symmetric to case 6 above. 

9. > qA\B A ag > qB\A- In this case, v does not adopt any item. 

Since the possible world model is equivalent to Com-IC (Theorem 1 ), the lemma holds as a result. □ 

Lemma 3 (re-stated). In the Com-IC model, if B is indifferent to A (i.e., qB\A = qB\<6)> then for any 
fixed B seed set Sb, the probability distribution over sets of B-adopted nodes is independent of A-seed set. 
Symmetrically, the probability distribution over sets of A-adopted nodes is also independent of B-seed set if 
A is indifferent to B. 
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Proof. Consider an arbitrary possible world W. Let q := A node v becomes ;B-adopted in W as 

long as Qfg < g and there is a live-edge path Pg from Sb to v such that for all nodes w on Pg (excluding 
seeds), ag < q. Since (?b |0 = qB\Ai this condition under which v becomes P-adopted in W is completely 
independent of any node’s state w.r.t. A. Thus, the propagation of P-adoption is completely independent of 
the actual A-seed set (even empty). Due to the equivalence of the possible world model and Com-IC, the 
lemma holds. □ 

Claim 3 (re-stated). On any A-path Pj\^, if some node w adopts B and all nodes before w on P _4 are A-ready, 
then every node following w on P _4 adopts both A and B, regardless of the actual B-seed set. 

Proof of Claim 3. Since P _4 is an A-path (where all nodes are A-adopted), every node w on P _4 (except the 
starting node) has < qA\B- For every such node w, if g_ 4|0 < a'^ < then w adopting A implies that 
w must have adopted B first and ag < < 7010 . 

Now suppose a node w on the path adopts B (under some P-seed set), and all nodes before w on path 
Pa are A-ready. Then all nodes before w on this path adopts A regardless of P-seed set. Thus w is informed 
of A. Since < q^|B, w adopts A. 

Then consider the node w' after w on the path P_ 4 . Node w' must be informed by both A and B since w 
adopts both A and B and the edge {w,w') is live. If < < 7 ^ 10 , w' will adopt A, and then since qB\A = 1j 
w' will then adopt B - this is where we use the key assumption that qB\A = 1- If QA\<ii < o:a — dA\Bj then 
we have argued that in this case ag < (7 b|0, so w' would adopt B, followed by adopting A. We can then 
inductively use the above argument along the path to show that every node after w will adopt both A and 
B, which proves the claim. □ 

Claim 4 (re-stated). There is a B-path Pb from some B-seed ccq G TU {u} to w, such that even if xq is the 
only B-seed, w still adopts B. 

Proof of Claim 4- We construct such a path Pb backwards from w as follows. 

Since w adopts B, ii w G T U {u}, then we are done; otherwise, there must be some ;B-path Pg from 
B-seed set T U {u} to w. If Pg is a P-ready path, then we have found Pb to be Pg, and its starting node is 
a;o- This is because even if xq is the only P-seed, B can still pass through all nodes on Pb to reach w, without 
the need of any node on the path to adopt A, because by definition, all nodes u on a P-ready path satisfies 
that ag < 9810 . 

Now suppose Pg is not P-ready. It is clear that every non-P-ready node on path Pg must be A-ready 
(otherwise, it would not be possible for such a node to adopt B). If every such non-P-ready node has an 
A-ready path from S'_ 4 , then we still find Pb = Pg with its starting node is xq. This is because whenever B 
reaches a non-P-ready node on the path, the node has an A-ready path from and thus the node always 
adopt A, which means it will also adopt B (for qB\A — I)- 

Now consider the case where there exists some non-P-ready node on path Pg that does not have any 
A-ready path from Sa- Let xi be the first such node (we count backwards from w). By the definition of xi, 
we know that as long as xi adopts B (regardless what is the actual P-seed set), then xi would pass B along 
path Pg to w. Thus our backward construction has found the last piece of the path Pb as the path segment 
of Pg from xi to w, now we move the construction backward starting from xi. 

At xi, find a A-path P_(^ from Sa to xi. By the definition of Xi^, we know that P^ is not A-ready. Let 
X 2 be the first non-A-ready node on path P^ counting forward from the starting A-seed. Then X 2 must have 
adopted B. By Claim 3, we know that as long as X 2 adopts B, all nodes after X 2 on path Pf^ would adopt 
both A and B, regardless of the actual P-seed set. Applying this to xi, we know that Xi is both A- and 
P-adopted. Then our backward construction has found the next piece of Pg, which is the path segment of 
P^ from X 2 to xi, which guarantees that if X 2 adopts B, then w must eventually adopt B. 

Now if X 2 is a P-seed, we are done. If not, we will repeat the above “zig-zag constructions”: there must be 
a P-path Pg from P-seed set T U {u} to X 2 . The argument on path Pg is exactly the same as the argument 
on Pg, and if the construction still cannot stop, we will find path Pj( similar to P^, and so on. 

^Recall that it is ^-ready, on a B-path, and there is no ^-ready path from to xi 
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The construction keeps going backwards, and since the construction actually follows the strict adoption 
time line and going backward in time, in the diffusion process when TU{u} is the ;B-seed set, the construction 
must eventually reach a ,B-seed xq and stops. Then we have found the desired path Pjg and the desired starting 
node Xq. □ 

B.3 Proofs for Results in Section 6 

Lemma 4 (re-stated). Let W be a fixed possible world. Let fv,w{S) be an indicator function that takes on 

1 if S can activate v in W, and 0 otherwise. Then, fv,wi’) is monotone and submodular for all v € V if 
and only if both (PI) and (P2) are satisfied in W. 

Proof. First consider “if”. Suppose both properties hold in W. Monotonicity directly follows from Prop¬ 
erty (PI). For submodularity, suppose v can be activated by set T U {x} but not by T, where x ^ T. By 
Property (P2), there exists some u G T U {a;} such that {u} can activate v in W. If u G T, then T can 
also activate v by Property (PI), a contradiction. Hence we have u = x. Then, consider any subset S GT. 
Note that by Property (PI), S cannot activate v (otherwise so could T), while S U {a;} can. Thus, fv,w{‘) 
is submodular. 

Next we consider “only if”. Suppose fv,wi’) is monotone and submodular for every v G V. Property (PI) 
directly follows from monotonicity. For Property (P2), suppose for a contradiction that there exists a seed 
set S that can activate v in W, but there is no u G S' so that {u} activates v alone. We repeatedly remove 
elements from S until the remaining set is the minimal set that can still activate v. Let the remaining set 
be S'. Note that S' contains at least two elements. Let u G S', and then we have /„ ,w( 0 ) = fv,w{{u}) = 
fv,w{S' \ {u}) = 0 , but w{S') = 1, which violates submodularity, a contradiction. This completes the 
proof. □ 

Lemma 5 (re-stated). Let M be a stochastic diffusion model and M' be its equivalent possible world model. 
If M' satisfies Properties (PI) and (P2), then the RR-sets as defined in Definition 1 have the activation 
equivalence property as in Definition 2. 

Proof. It is sufficient to prove that in every possible world W G W, S activates v if and only if S intersects 
with u’s RR set in W, denoted by Rw{v). 

Suppose Rw{v) n S 0. Without loss of generality, we assume a node u is in the intersection. By the 
definition of RR set, set {u} can activate v in W. Per Property (PI), S can also activate v in W. 

Now suppose S activates v in W. Per Property (P2), there exists u G S such that {m} can also activate 
V in W. Then by the RR-set definition, u G Rw{v)- Therefore, S n Rw{v) 0. □ 

Theorem 7 (re-stated). Under one-way complementarity ( 9^10 < qj(\B o,nd gg |0 = the RR-sets 

generated by the RR-SIM algorithm satisfy Definition 1 for the SelfInfMax problem. As a result, Theorem 6 
applies to GeneralTIM with RR-SIM in this case. 

Proof. It suffices to show that, given a fixed possible world W, a fixed ;B-seed set Sb, and a certain node 
u G V, for any node v ^ (0, Sb) with we have: v G $^4 ({u}, Sb) if and only if there exists a 

live-edge path P from it to u such that for all nodes w G P, excluding it, w satisfies < gyi|B, and in case 
c^A > 9 a| 0 ) then w must be ;B-adopted. 

The “if” direction is straightforward as P will propagate the adoption information of A all the way to v. 
If 0)4 < q^| 0 , it adopts A without question. If 0)4 G ( 9 ^ 10 , qA\B\i then v must be ,B-adopted by the definition 
of P, which makes it ^-adopted. 

For the “only if’ part, suppose no such P exists for it. This leads to a direct contradiction since it is the 
only M-seed, and u lacks a live-edge path to v, it is impossible for v to get informed of A, let alone adopting 
A. Next, suppose there is a live-edge path P from it to v, but there is a certain node w G P which violates 
the conditions set out in the lemma. First, w could be have a “bad” threshold: > 9 _ 4 |b. In this case, w 

will not adopt A regardless of its status w.r.t. B, and hence the propagation of A will not reach v. Second, 
w could have a threshold such that a'fl G (9a|0)9a|b] but it does not adopt B under the influence of the 
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given S'b. Similar to the previous case, w will not adopt A and the propagation of A will not reach v. This 
completes the “only if” part. 

Then by Definition 1, the theorem follows. □ 

Lemma 6 In expectation, GeneralTIM with RR-SIM runs in O {{k + + \E\) log \V\ (1 + EPTp/EPT b)) 

time, (re-stated). 

Proof. Given a fixed RR-set R Q V, let uj{R) be the number of edges in G that point to nodes in R. Since 
in RR-SIM, it is possible that we do not examine incoming edges to a node added to the RR-set (c/. Cases 
l{ii) and 2{ii) in the backward BPS), we have: 

EPTb < E[a;(R)], 

where the expectation is taken over the random choices of R. By Lemma 4 in [24] (note that this lemma 
only relies on the activation equivalence property of RR-sets, which holds true in our current one-way 
complementarity setting). 


This gives 


^•EKR)]< OPTfc. 


[^■EPTb < OPTk. 

\E\ 


Following the same analysis as in [24] one can check that the lower bound LB of OPTk obtained by the 
estimation method in [24] guarantees that LB > EPTb ■ Since in our algorithm we set 9 = X/LB, 

where (following Eq.(3)) 


A = 


(8 + 2e)|R| (^nog|R|+log(^l^l) +log2^^ 


then we have that the expected running time of generating all RR-sets is: 


0{9 ■ EPT) 



■ (EPTf + EPTb)) 


O 

o 


|R| 'v EPTb) 


Oiik + mV\ + \E\)\og\V\ 



EPTf \^\ 
EPTb ))■ 


The time complexity for estimating LB and for calculating the final seed set given RR-sets are the same 
as in [24], and thus the final complexity is O [{k ^)(|R| -|- |if|) log \V\ ^1 -I- □ 

Theorem 8 (re-stated). Suppose that g_4|0 < q^\B and gB|0 < qi3\A — 1- The RR-sets generated by the 
RR-CIM algorithm satisfies Definition 1 for the CompInfMax problem. As a result, Theorem 6 applies to 
GeneralTIM with RR-GIM in this case. 


Proof. It suffices to show the following claim. 

Claim 5. Consider any fixed possible world W, a fixed A-seed set Sa, and a certain node u G V. Then, for 
any node v ^ 4>)^(S'_4,0) with we have: v G 4>)^(5'^, {u}) if and only if there exists a live-edge 

path P{u,v) from u to v such that one of the following holds : 
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• (i). V{u,v) consists entirely of A-adopted or diffusible A-suspended/potential nodes, and u must be 
A-suspended, or 

• (ii). There exists an A-suspended node u' u on V{u, v), such that all nodes on the sub-path ofV{u, u') 
(excluding u) are B-diffusible and the remaining ones (excluding u' and v) are either A-adopted or 
diffusible A-suspended/potential. 

Proof of Claim 5. (•^=). Suppose (*) holds. Clearly, u will adopt B first as a seed and then adopt A by 
reconsideration. Then all nodes on V{u,v) including v will adopt both A and B since they are either A- 
adopted or diffusible ^-suspended/potential. Consider any node w / v on V{u, v): if ic is already ^-adopted, 
it will adopt B since Qbia = otherwise, w adopts B first since Og < q^iih (diffusible) and then A. Next, 
snppose {ii) holds. Since S13 = {u}, u adopts B, so do all nodes on V{u,u'). Since u' is ^-snspended, it will 
then adopt A by reconsideration. The rest of the argument is exactly the same as the case above. 

(=>). Since v is ^-adopted, there must exist a live-edge path on P 4 from to v snch that all nodes 
on Pj\^ are .4-adopted. Suppose every node w G P 4 have aJ^ < g^| 0 , then all nodes on P 4 wonld adopt A 
when Sb = 0, including v, which is a contradiction. Thns, there exists at least one node w G P 4 snch that 
qA\% < c^A — ^A\B- Let wi be the first of snch nodes, connting from upstream. In order for v to adopt A, wi 
must adopt A, and in order for that to happen, wi mnst adopt B first. 

If the sub-path 'P'{wi,v) does not have non-diffusible nodes, then it satisfies (i) and wi = u as in 
the theorem. Now suppose there is one node W 2 G V'{wi,v) that is non-diffnsible, i.e., > gg |0 and 

( 7 _ 4|0 < of/ < Qaib- This means in order for W 2 to adopt A and B, it mnst be a seed for either or both 
items. By construction, W 2 is not an ^-seed, and thus it must be a S-seed. Next, since wi adopts B and 
W 2 is the only S-seed, there mnst be a live-edge path of K-diffusible nodes from W 2 to wi. In this way, 
V'{w 2 , v) satisfies {ii), with W 2 = u and wi = u' as in the theorem. Also note that if there are more than one 
non-diffusible nodes on V'{wx,v), all of them must be ;B-seeds in order to make v adopt A. Since the theorem 
only considers singleton ;B-seed sets, snch paths will not lead v to adopt A with only one non-diffnsible node 
as ;B-seed. This completes the proof. □ 

By Definition 1, the theorem thus holds true. □ □ 

Lemma 8 In expectation, GeneralTIM with RR-CIM runs in O (j^k -I- f')(|C| -I- \E\) log \V\ ^1 -I- ^^'^epTbs’^° ) 
time, (re-stated). 

Proof. The following inequality continues to hold for RR-CIM, by applying Lemma 4 in [24]. 

|y| • EPTbs/\E\ < OPTk, 

Since we employ lazy sampling (of possible worlds), we conclude that EPTp -|- EPTbs + EPTbo ^ \E\. 
That is, in the process of generating a single RR-set, the worst case is to flip a coin for every edge, and every 
edge becomes live, and the search goes on. The rest of the analysis is similar to Lemma 6 , and finally, the 
expected time complexity for GeneralTIM + RR-CIM is 

o (^{k + e){\v\ + |A|)iog|R| ■ 


□ 


Theorem 9 (re-stated). Sandwich Approximation solution gives: 

^{Ssand) > max {^, • (1 - 1/e) • a{s:), 

where S* is the optimal solution maximizing a (subject to cardinality constraint k). 
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Proof. Let S* and S* be the optimal solution to maximizing fi and v respectively. We have 


^iS.) = ^ ^ • (1 - 1/e) • HSt) 

iy[S^) v[Su) 

> ^ • (1 - 1/e) • ^{SD > ^ ■ (1 - 1/e) • a{Sl)-, (7) 
v(bi,) v{by) 

aiS^) > > (1 - 1 /e) • > (1 - 1 /e) • 

>f^y{l-l/e)-a{S:). ( 8 ) 

The theorem follows by applying Eq. (5), the definition of Sgand- Q 

Theorem 10 (re-stated). Suppose < 7,410 < qA\B < 7 g |0 < < 701 , 4 . Then, under the Com-IC model, 
for any fixed A and B seed sets Sa cind S'g, cr, 4 (<S', 4 , S'g) is monotonically increasing w.r.t. any one of 
{QA\<ii-: Qa\Bi Qi 3 \a} with other three GAPs fixed, as long as after the increase the parameters are still in 

Q+ 

Proof (Sketch). The detailed proof would follow the similar induction proof structure for each possible world 
as in the proof of Theorem 3. Intuitively, we would prove inductively that at every step increasing < 7,410 or 
Qaib would increase both ^-adopted and ;B-adopted nodes. □ 

B.4 Submodularity Analysis for Competitive Cases of Com-IC 

First, we address cross-submodularity. Note that by Theorem 3, a a is monotonically decreasing w.r.t. S'g 
(with any fixed 5 , 4 ). Intuitively, cross-submodularity for competitive products means adding an additional 
;B-seed to a smaller ,B-seed set yields a larger decrease in the spread of A: for any S C T C V and any 
u ^ T, aA{SA, S) — <t, 4 (S', 4 , S U {u}) > <t^(S', 4 , T) — aA{SA, T U {«}). This notion is relevant and useful to 
the problem of influence blocking maximization, where one party wants to find the best seed set to block the 
spread of competitors [5,13]. Since influence blocking is not the focus of this work, from here on, we focus 
on self-submodularity and self-monotonicity only, and hereafter we drop “self-” in the terminologies. 

When g 4|0 = < 7 g |0 = 1 and < 74 |g = qB\A = 0) the Com-IC model degenerates to the homogeneous CIC 
model, for which submodularity and monotonicity both hold [ 8 ]. In the general setting, monotonicity 
holds (Theorem 3). However, the following counter-example shows the opposite for submodularity. 

Example 5. Consider the graph in Figure 12, where all edges have an influence probability of 1. Values of 
Q are: < 7,410 = <7 S (0,1), < 7 ^|g = qB\A = 0, < 7 b |0 = 1. The H-seed set is S'g = {y}. For A-seed set S', 4 , let 
S = {si}, T = {si, S 2 }, and u = S 3 . 

We consider the probability v becomes A-adopted with Sa, denoted by apA(v, Sa) (Sb is omitted since 
it is clear from context): 

apA{v,S) = 0 , 
apAiv, S U {u}) = q^, 
aPAiv,T) = 0 , 

apA{v, T U {ii}) =q'^ + {l-q)- < 7 ®. 


Hence, we have 

{apA{v, T U {u}) - apA{v, T)) - {apA{v, S U {u}) - apA{v, S)) 

= (1 - <7) • <7® > 0. 

Non-submodularity occurs for the entire graph if v is replicated sufficiently many times. □ 
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Figure 12: Graph for Example 5 


The intuition of the above counter-example is that yl-seeds S2 and S3 together can block B completely, 
and thus even if they cannot successfully activate v to adopt A, si could later activate v (with the additional 
probability (1 — g) • g®), but when S2 or S3 acts alone, it cannot block B, so the influence of A from si will 
not reach v. 

Next, we show a positive result which says that submodularity is satisfied as long as g ^|0 = gB |0 = 1- 

Theorem 11. In the Com-IC model, when g ^|0 = q^ith = 1, the influence spread function cr^{S^, Sis) is 
submodular w.r.t. Sa, for any given Sb- 

Proof. We fix a ,B-seed set Sb and consider an arbitrary possible world W to show that submodularity is 
satisfied in W. 

In the possible world W, for an .4-adopted node v, there must exist a live-edge path from .A-seed set Sa to 
V with all nodes on the path adopting A, and we call this path an Al-path (K-path is defined symmetrically). 
With mutual competition, the length of the shortest Al-path from Sa to v is the same as the time step at 
which V adopts A. This is because in competitions when a node is informed about an item, it only has one 
chance at the same time step to decide about the adoption (reconsideration in later steps is not possible). 

Claim 6. If a node v is reachable from Sa or Sb in the possible world W, then v must adopt at least one 
of A and B. 

Proof of Claim 6. This is due to the fact that g ^|0 = gB |0 = 1. In this case, suppose a node v is reachable 
from Sa, i.e. there is a path from Sa to v in the possible world W. The claim can be proven by an induction 
on the path length. Essentially, no matter which item an earlier node on the path adopts, it would inform the 
next node on the path, and since g_ 4|0 = gB |0 = 1, the node would adopt the first item it get informed. □ 

Let S' C T C F be two sets of nodes, and it £ F \ T be an additional yl-seed. For a fixed ,B-seed set Sg 
(omitted from the following notations), let <I>)^(S^) denote the set of .4-adopted nodes in the possible world 
W with .4-seed set Sa after the diffusion ends. Consider a node v £ (T U {it}) \ (T). We need to show 

i;£$J(SU{ii})\$)f(S). 

We now construct a shortest 4l-path, denoted P 4 , from some node in wg £ rujitj to v when Sa = rujitj. 
The path construction is done backwards. Starting from v, if v has only one in-neighbor w attempting to 
inform v through the live edge (open information channel) from w to v, then in the previous time step, w 
must have adopted A, and we select w as the predecessor of v on path Pa- If there are multiple in-neighbors 
that have live edges pointing to v, we examine the permutation iTy and pick w as v’s predecessor on this 
path, such that w ranks first over all of u’s in-neighbors that adopted A in the previous time step. We trace 
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back and stop once we reach an ^-seed in ru{u}. It is clear that P 4 must be a shortest ^-path from ru{u} 
to V, since in every construction step we move back one time step. 

Claim 7. Let wq G TU{rt} be the starting point of P^. Then, even when = {ruo}> nodes on P 4 would 
still adopt A. 

Proof of Claim 7. Suppose, for a contradiction, that some node on P 4 does not adopt A when = {rco}- 
Let w be the first of such nodes on P 4 (counting from wq)^. By Claim 6 , w must adopt S as it does not adopt 
A. This means that there must exist a S-path from S'g to w. We construct a shortest ,B-path Pg from S'g to 
w in the same way as we constructed P 4 : start from w' = w backwards and always select the in-neighbor 
of the current node w' that adopts B and is ordered first in Moreover, since w is informed of A (by its 
predecessor on path P 4 ) but does not adopt A, it holds that g_ 4 |B < in W. 

We now compare the length of Pg, denoted by f'g, to the length of the segment of P _4 from wq to w, 
denoted by Suppose f'g > Then w would still adopt A when = {wq} (regardless of whether it 
will adopt B or not), a contradiction. Next consider £g < When T U {u} is the yl-seed set, from P-seed 
set S'g and through the path Pg, every time step one more node on path Pg has to adopt either A or B (by 
Claim 6 ). Thus by time step £g, w adopts either A or B. But since P _4 is the shortest M-path from T U {u} 
to w and > ^g, w cannot adopt A at step £g, so w must adopt B at step £g. Since > Qaibi if means 
that w would not adopt A after adopting B, also a contradiction. 

We are left with the case is = (-a- Let wa and uig be the predecessor of w on Pa and Pg, respectively. 
Assume wa 7 ^ wb (we deal with wa = WB later). If wa is ordered ahead of wg in tt^,, then according to 
the tie-breaking rule, w would be informed of A first and adopt A (since < < 7 , 410 ) when wq is the only 
M-seed, contradicting the definition of w. 

If wi3 is ordered ahead of wa in tTw, then consider again the scenario when Sa = T U {u}. By Claim 6 , 
through the path Pg, wb adopts either A or P by step £g — 1. If wg adopts A, it contradicts the construction 
of Pa since we would have chosen wb instead of wa in the backward construction. If wb adopts B, then 
w would be informed of B from wb first, and then due to > < 7 , 4 |g, w would not adopt A, again a 
contradiction. 

Finally, we also need to consider the case of wa = wb- Since w does not adopt A when wg is the only 
M-seed, according to our tie-breaking rule, node wa must adopt B first and then adopt A at the same step. 
We then trace back the predecessor of wa on path P ,4 and Pb respectively. If these two paths never branch 
backward, i.e. P ,4 = Pb, then we know that wg is both A and P-seed. According to the possible world model 
we would use Tw„ to decide whether wg adopts A or B first, and then all nodes on the path P ,4 = Pb would 
follow the same order. Since wa adopts B first before A, we know that is such that B is ranked ahead 
of A. 

Now we consider the scenario when Sa = T U {u}. By the construction of path Pa, the successor of wg 
on this path, wi, orders wg first among all A-adopted in-neighbors, so wi would take wg to get informed of 
A, but since wg adopts B first, this implies that wi also adopts B first. We can apply the same argument 
along the path P 4 to see that w adopts B first before A. However, since > < 7 , 4 |g, we know that w would 
not adopt A, a contradiction. 

If paths Pa and Pb branch at some node x when we trace backward, then let xa and xb are the two 
predecessors of x on paths Pa and Pb respectively, and xa 7 ^ xb- When wg is the only A-seed, we know 
from above that wa adopts B first before adopting A. By the construction of Pg, wa must be informed of 
B from its predecessor on Pb, so if this predecessor has not branched yet, it must also adopts B before A. 
Using the same argument, we know that x must adopt B first before adopting A. Then we know that xg 
must be ordered before xa in tTx- Now consider the scenario when Sa = T U {u}. The argument is exactly 
the same as the previous case of wb ordered before wa, and we also reach a contradiction. 

We have exhausted all cases and Claim 7 is proven. □ 

Claim 7 immediately implies that wg = u, since otherwise, we have wg G T, and by the proof of 
monotonicity (Theorem 3) we know that v G <i>)^({r(;o}) C This contradicts with the definition 

^ Wg, since as a seed, wg adopts automatically. 
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ofve $J(TUM)\$^(T). 

Again by monotonicity, we have v € ‘h^d'u}) C $^(5” U {m}). Since S C T and v ^ then 

V ^ (S'). This gives v € $^4 (S U {«}) \ (S), which was to be shown. □ 
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